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PREFACE 



This book contains a simplified version of a lecture course delivered 
over several years in the University of Glasgow. In it I have adopted 
the standpoint of E. T. Jaynes, that statistical mechanics is concerned 
with making the best of situations where we must forecast the be- 
haviour of physical systems given only partial information about 
their condition, using the quantitative measure of information 
developed by Shannon. 

One or two good books based on this approach already exist. I 
have tried here to give a clear account of a range of important physi- 
cal results while using no mathematics beyond quite elementary 
algebra and analysis. A few standard results in analysis and quantum 
theory are used although their proof requires more mathematical 
technique than does this book itself. These results are easy to under- 
stand and use, and this book can certainly be studied with no more 
than an elementary knowledge of quantum physics. 

Taylor's expansion, and iteration, are the most convenient mathe- 
matical crowbars available to the working physicist for prying open 
problems. They appear repeatedly through the work. 

I have learnt from many books and colleagues. Books by Schro- 
dinger, Tolman and Khinchin I have found particularly important. 
Drs W. K. Burton and R. O. Davies have offered frequent and fruit- 
ful criticism, which I have respected, and advice, which I have 
occasionally accepted. Blame for the remaining errors and obscuri- 
ties must be all my own. 



INTRODUCTION 



Thermal physics has two main divisions, one concerned with the 
static, equilibrium properties of matter, the other concerned with the 
rates of processes. In systems which are inhomogeneous in space — 
containing variations of temperature or concentration, for example — 
transport processes such as heat flow and diffusion occur: in homo- 
geneous systems not in an equilibrium state, such as a magnetized 
paramagnetic substance just after the removal of the polarizing field, 
pure relaxation processes occur. 

With increased understanding of the properties of atoms and 
molecules, it has become possible in the realm of statistical physics to 
understand the relation between the bulk properties of matter and 
the structure and behaviour of its atomic and molecular constituents. 
The term 'statistical mechanics' has by a historical accident (equi- 
librium theory, being easier, developed faster) been reserved for the 
theory of equilibrium properties, while 'rate theory' or 'kinetic 
theory' describes the explanation of time-dependent effects. 

The prototype of statistical physics is the kinetic theory of gases, 
developed through the latter part of the nineteenth century, especially 
by Maxwell, Clausius and Boltzmann. This theory gives at an elemen- 
tary level an account of the ideal gas, and an outline of transport 
processes in real gases, in terms of the concepts of the mean energy, 
and the mean free path, of gas molecules. 

One of its most important results is the Maxwell-Boltzmann velo- 
city distribution law, that the number of molecules with speeds in 
the range c->c+dc is proportional to c 2 exp (—mc 2 /2kT) dc, and its 
extension, to add potential energy to the kinetic energy in the ex- 
ponent. In this expression m is the molecular mass, T the absolute 
temperature, and k the general constant named after Boltzmann. 

One of Boltzmann's major contributions was his identification of 
the thermodynamic function, the entropy, with a function of the 
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dynamical condition of the system. At the turn of the century, Wil- 
lard Gibbs published a further great clarification of the whole subject 
of equilibrium statistical mechanics. 

Two of the results of Boltzmann and Gibbs can be very simply 
stated by reference to quantum states. They will be derived, and used 
repeatedly, in the body of this book, but are worth giving here. 

The Gibbs canonical distribution is defined as follows: 

Let the ith energy state of a system have energy e t , and let the 
system be in thermal equilibrium at absolute temperature T: then the 
probability that the system should be found by experiment to be in a 
state of energy e y is to its probability of being in a state of energy e t in 
the ratio exp [(e t — « y )/W]. 

More loosely, the probability that the system is in state i is propor- 
tional to exp (—e t /kT). 

Problem: (a) Is this identical with saying that the probability that 
the system has energy e t is proportional to exp(— ^/AT)? (b) A 
particular atom has one state of zero energy and three states all on 
the same energy level at k x 1500 K. At what temperature (measured 
in K) are the two levels equally populated? 

Boltzmann's theorem on the entropy may be stated thus : 
If the probability of occupation of the ith state of a system isp u the 
entropy S of the system is given by the sum over all states 

-k^ptlnp, 
t 

where In denotes the logarithm to base e. 

Problem: On the (unphysical) assumption that the atom described 
in (b) above has only the four states cited, what is its entropy, as- 
suming the Gibbs ratio of the/? ( , at the temperature defined in (b), at 
T=0, at r=oo and at T= —e, where e is arbitrarily small. 

It is clearly probable that for a system in thermal equilibrium the 
function Z=2i exp (—eJkT), which is called the canonical partition 
function, will have properties of interest. We shall see that it is 
indeed of primary importance in the theory. 

A central problem in establishing the theory is to obtain the Gibbs 
canonical distribution. As usual in theory, several approaches are 
possible. The simplest, and in many ways the most sensible, is to 
announce that this is the primary axiom of the theory, to be postu- 
lated and not derived. The utilitarian justification of this axiom is the 
correspondence of the theory with classical thermodynamics. 
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The Gibbs distribution is so specialized in appearance that one is 
tempted to find simpler axioms which produce it as a theorem. In the 
form of Maxwell-Boltzmann distribution, it has been deduced 
repeatedly by dynamical arguments, all of which on careful examina- 
tion reveal an assumption of randomness not noticeably simpler than 
the conclusion. For particular systems, it can be derived from the 
second law of thermodynamics, by the impossibility of a perpetual 
motion of the second kind. 

An alternative and very attractive approach comes from the ap- 
pearance of a function, closely related to Boltzmann's form for the 
entropy, in the theory of communication engineering. Engineers con- 
cerned with the economic problem of efficient coding of messages 
developed a quantitative expression for the mean information trans- 
mitted in a given class of message. This is given by - 2i Pt In p t where 
Pi is the probability of occurrence of the ith message in the class. The 
mathematical theory of communication was almost entirely ori- 
ginated by Shannon. Evidently in this theory 'information' is made a 
term of art, with a restricted meaning, just as was done with 'work' 
and 'energy' in the past. 

As we shall see, Shannon's definition of information enables us to 
assess the information content of a stated set of probabilities p, for 
the occurrence of distinct events E { . This leads to a definite policy of 
maximum open-mindedness. Given partial information about a 
situation, and a demand to assess the probabilities of its distinct pos- 
sible states, we are to choose that set of probabilities which adds the 
least arbitrary information (in Shannon's sense) to what is given. 

Some cautions are perhaps needed. First of all, the information 
content of a message is quite independent of the importance of its 
contents. It measures only its surprise value relative to the other 
members of the class of possible messages. Second, maximum open- 
mindedness is not always the best policy, in a more general sense. 
Best accuracy in medical diagnosis is indeed obtained by suspending 
judgment, and treatment,' till the patient dies, then conducting a 
really careful post-mortem. There is then, however, no prognosis. 
Third, the communication literature often calls 'entropy' the func- 
tion which we would obtain by taking k= — 1, or — l/ln2. The 
change of sign is sometimes confusing. 

E. T. Jaynes, whose arguments are followed here in a simplified 
form, showed how an exponential distribution corresponding to that 
of Gibbs can be obtained from a simpler statistical question. A very 
useful bonus is the establishment that the form for the entropy given 
above is reasonable, even when the condition of the system does not 
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correspond to a thermal equilibrium. One or two good texts, par- 
ticularly the large book by Tribus, have been based on these 
arguments. 

The present book starts with an introduction to information 
theory, leading to the establishment of the canonical distribution and 
the connection between statistical and thermodynamic functions. 
We then work through the statistical mechanics of a series of simple 
systems. These make useful structural elements in more elaborate 
models, approximating to the behaviour of real objects. 

The grand canonical distribution is introduced at the earliest stage 
at which it can be used with profit. Once this step is taken, we have 
much more freedom of action in tackling problems with many 
similar particles. 

The Ising problem in one dimension is discussed at some length, 
and this concludes the set of topics which are precisely dealt with by 
elementary technique. 

Most many-body problems can only be approached by fairly 
brutal approximations even with powerful methods. Using elemen- 
tary methods, however, we obtain a number of interesting results, 
and the lattice gas provides a tractable and useful model for gases, 
liquids and solid alloys. The central problem of many-body theory 
is to give a reasonable account of phase changes, and the discussion 
of ordering on lattices gives some insight even under the very 
simplified quasi-chemical approximation. 

In the latter part of this book we are rather in the position of 
Stone Age tribesmen making sporadic hunting trips into forest 
country which can only be cleared and colonized by the technology 
of a more developed civilization. It may be some consolation to the 
reader that known exact results, giving much more physical informa- 
tion than we obtain in this book, almost all require mathematical 
technique well beyond the level of the normal undergraduate courses 
in physics and chemistry. The enormous literature of statistical 
mechanics is largely concerned with the elaboration of ideas which I 
have attempted to present here in a clear form, free of some of the 
technical difficulties which frequently obscure them. 



INFORMATION AND THERMODYNAMICS 



State and condition 

The problem of statistical mechanics, which is, historically, to 
construct a mechanical theory reproducing the results of thermo- 
dynamics and relating thermodynamic functions to molecular pro- 
perties, becomes in more general terms the question: 'Given certain 
partial information about the condition of a physical system, how 
are we to make the "best" prediction of the results of further 
measurements made on the system?' 

The answers to a particular class of such questions turn out indeed 
to be equivalent to thermodynamics — to these questions we confine 
our attention in this book. The more general statistical mechanics 
which follows this approach can in fact deal with problems other 
than those of thermal equilibrium, but that leads to topics no longer 
elementary. 

The state of a physical system is determined when we know its 
composition and the detail of its motion. By composition we mean so 
many particles, of such sorts, in such states of combination. Groups 
of particles whose mutual relation is unaltered during the whole 
period we are to study — stable molecules, for example — can be 
treated as entities of one independent type even if similar particles 
appear in another role elsewhere in the system. 

In classical mechanics, detailed specification of the motion of a 
system implies a knowledge of all positions and velocities of, and the 
forces of interaction between, particles so that a unique set of posi- 
tions and velocities is determined at any later time, given the com- 
plete set of positions and velocities at one time. 
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In quantum mechanics, which physical systems do in fact obey, 
it is meaningless to make a simultaneous specification of positions 
and velocities — more precisely, positions and the corresponding 
momenta — with arbitrarily great precision. The state of motion of a 
system is determined with maximum precision — as a so-called 'pure' 
state — by the exact prescription of the simultaneous values of as 
many physical variables as possible. The set of variables for a given 
system can, of course, be chosen in many different ways and often 
does not include the positions of the constituent particles; more com- 
monly the chosen variables are the energy and various momenta, in 
particular the total linear momentum and angular momentum of the 
system. 

The energy has a specially interesting role because in quantum 
mechanics energy and time are conjugate quantities, so that an exact 
statement of the energy of a system is incompatible with any state- 
ment of the time of an event involving that system: the states of the 
system which are constant in time are then the states of definite 
energy. Since we are concerned with steady (or slowly varying) 
conditions we find it specially convenient to use these states of 
definite energy, 'eigenstates' of energy in the jargon of quantum 
mechanics, 1 as elements in describing more general conditions of 
the system. 

The description of such a general condition is easily visualized in 
terms of an ideal 'complete' experiment, to which many physical ex- 
periments indeed approximate. Suppose that many copies of the 
system are prepared with similar histories, so that we must accept 
them as being in the same condition. We then pass them through a 
complex spectrometer sorting them into groups possessing identical 
values of as extensive a set as possible of compatibly measurable 
quantities, so that each system in a given group has ended in the same 
pure state. Then the number of systems in the given group divided 
by the total number passed into the spectrometer measures (in the 
limit of large numbers) the probability that the system in the pre- 
scribed initial condition should be found in this particular pure 
state. The condition of the system is then described by specifying the 

1 It is of course historically true that statistical mechanics was developed 
without quantum theory and indeed long before it. Planck's discovery of the 
quantum of action stemmed in fact from the failure of classical statistics to 
account for the colour of thermal radiation. However, classical statistical 
mechanics is extremely hard. We shall 'base our treatment of statistical mech- 
anics entirely on the quantum picture just outlined, and fall back on classical 
mechanics only as an approximation, very useful in treating a number of prob- 
lems. 
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spectrometer— that is, a particular class of pure states— and the 
corresponding set of probabilities. 

The condition of the system is not completely determined by the 
set of probabilities associated with this condition by an arbitrarily 
selected spectrometer. It is however true that associated with any 
possible condition of the system is some particular spectrometer, at 
least imaginable if not always practicable, determining a particular 
complete set of independent pure states; the condition is then com- 
pletely determined by the identification of this set of states and the 
specification of the corresponding set of probabilities. This result is a 
very important but not very elementary theorem of quantum 
mechanics. We shall be using a special case of it, namely, that any 
stationary condition is determined by specifying the probabilities 
that a system in this condition be found in its various possible pure 
states corresponding to definite values of the energy. 

Suppose that these energy eigenstates for a system of known 
constitution are arranged in order of increasing energy with energy 
values €„, €1, e 2 , ...,«„.... If any group of distinct states all cor- 
respond to the same energy value, they will be ordered within the 
group according to the value A, of some other dynamical variable 
compatible with the energy. Let the probability of the occurrence 
of the ith state be/> s . Then the expectation value of the energy of the 
system is given as a sum over all possible states by 

u = 2p& 

i 

and is fixed when we know the />'s and the e's. 

It is interesting to ask by what criterion we should choose the set 
of Pi if, knowing the constitution of the system, and so the e it we are 
given the value of U and asked to predict the expectation value of 
some other variable such as 

L = 2M 

i 

If some e y were just equal to U we might choose />,= 1, p,=0 for all 
iYj, but this seems a very restricted and special choice. Evidently 
the most sensible choice of the p t will be that which adds the least 
amount of arbitrary additional information to what information we 
already have. 

The last sentence sounds very rational, but to make any practical 
use of it we require to establish a measure of information, at least for 
'the information carried by the set of probabilities associated with an 
exhaustive set of mutually exclusive events'. The mutually exclusive 
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events which interest us will of course be the appearance of the 
system in the various independent pure states. 

Information 

We now look for a reasonable way of assigning a numerical value to 
the information carried by the set of probabilities associated with a 
complete set of mutually exclusive events — the outcomes of subject- 
ing a system to our ideal spectrometer. 

Evidently, the more information that is given in the specification 
of the condition of the system, the less remains to be found by experi- 
ment; in an extreme case, if we know in advance that one outcome is 
certain, that is, that it has probability unity and all other outcomes 
have probability zero, the experiment must give this outcome and so 
tells us nothing new. It is therefore natural to turn first to the ques- 
tion of evaluating the information gained by a trial, as complemen- 
tary to the information carried by the specification. 

Let us consider the complete (exhaustive and mutually exclusive) 
set of events E t with the assigned probabilities p u and suppose that on 
one trial the particular event E { has occurred. This event is the more 
informative the less likely it was to occur, so the information in- 
creases as p t decreases. It is hard to imagine that, with really inde- 
pendent events, a change in the relative probability of E } and E k 
makes any difference to the information we gain from the occur- 
rence of E,. Accordingly, we propose that the information gained 
from the one occurrence of E t should be I{p^ where /is a monotonic 
decreasing function of otherwise unknown form. 

To identify this function / more closely, consider the result of 
making two independent trials. The pairs of events E t , E h which are 
the possible outcomes of the double trial, themselves constitute a 
complete set of events with probabilities p it which are equal to p^ 
since the events E t , E t are independent. Thus the occurrence of the 
particular pair of events E t and E, has given us information I(p u ) = 
I(PiPi)- It would be convenient if information were so defined that 
this I(p tJ ) were equal to the sum of the amounts of information 
obtained by considering the occurrence of E t and E f separately. 
(Inequality would then be the indication that E, and E, were not in 
fact independent events !) Thus we need to have 

IiPiPt) = I<J>i)+Hj>j) 

and we see that I(p)=— c In p, where c is an arbitrary positive 
constant (remember that /><1!), fits all the requirements we have 
developed so far. 
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If then we consider a great number N of independent trials, E t will 
occur Np t times and the average information gained per trial is 

1= -c^Pilnpt 

i 

Obviously /is zero if any p { = 1 (all the other p, are then zero). Let 
M be the number of possible outcomes E t . We have 

so for any permissible set of variations dp t of the p ( we must have 

2 d Pt = 

or 

Now — = -c(l+ln/>i) 

so the variation of / corresponding to the set of variations dp t of its 
arguments is 

d/= -c2(l+ln/>0dPi 

If any particular set of p f correspond to a turning value of /, then 
d/ is zero for dp t starting at this point, so the condition for a turning 
value of / is 

Z(l+ln Pt )dp ( = 

subject to a, ~2,dp ( =0, which is evidently equivalent to, b, 
2 In pt dp t =0, subject to the same auxiliary condition. 

The neat way to resolve this is by Lagrange's method. If a is a 
constant (to be determined immediately) the relation 

^Inpidpt+a^dp, = 

i.e. 2(ln/> f +a)d/? f = 

follows from a and b. Suppose a is — lnp x , whatever this may turn 
out to be. For this value of a, the coefficient of dp x is identically 
zero, so the other dp { may now be any imaginable set of variations. 
The fact that dp^ is 'minus their sum' drops out of the equations, 
and all the coefficients of the independent dp t must now vanish. 
We see then that a turning value of / occurs when all the p t are equal 
and of value exp (—a). The value of a is determined by looking back 
at the completeness equation 

2a = i 
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so / has its turning point when each p t = Af -1 . It is easily verified 
that this gives a maximum, and 

= clnM 

The most natural choice for the information carried by the set of 
probabilities p t would then seem to be 

H(j>i, ...,p t ,...) = i m *x-I(Pi, ■ ■ -,p t , ■ . •) 

taking the situation of perfect ignorance, in which all the alternatives 
seem equally likely, as the reference situation. 

However, we shall find ourselves dealing with a class of problems 
in which M is infinite, although the class of distributions p f will 
always be such as to make l(p ls . . .,p t , . . .) finite. To avoid having 
infinite constant terms in our mathematics, we then choose the situa- 
tion of perfect certainty, in which any one p t = 1, as reference situa- 
tion, so that the measure of the information 1 carried by a set of 
probabilities p, is 



H(Pi Pt,...) = 2>iln/>, 



i 



where we have taken the arbitrary constant c= 1. 

'Least information' prescription: In sum, then, when given partial 
information about a system which is to be described by a discrete set 
of probabilities p l3 we choose that set of p t which makes the sum 
J, ( Pilnpi a minimum without becoming inconsistent with the 
information supplied. 

The canonical distribution 

Accepting the measure of information developed in the last section, 
we find that the problem posed at the end of the previous section is in 

1 We have now shown that the function H defined above gives a convenient 
measure of information. It is not obvious that no other measure is con- 
veniently possible. The proof of this has been subjected to a great deal of 
refinement by the professional mathematicians. There is an excellent account 
in Khinchin (1957), and a more economical but rather harder discussion, due 
to Fadiev, in Feinstein's book (1958). Both these discussions require a certain 
amount of mathematical sophistication from the reader: Feinstein, A., Founda- 
tions of information theory (McGraw-Hill, 1958); Khinchin, A. I., Mathematical 
foundations of information theory (Dover, 1957). 
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sufficiently definite terms to be solved. It now runs as follows. Given 
a set of states with energy levels e„ and an expected energy 

where 2/>< = l > all ^< ^ ° 

i 

to determine thep, so that 2tPi In A is a minimum. 

The neatest procedure is again that of Lagrange. A variation dp t 
of each probability away from the desired value gives 

2(l+lnA)dp. = 
i 

2 dPi = o 

i 

2 «, dp, = 

i 

so for some constant y, j3 we have 

2(l+y+ln^+^)dA = 
t 

where the dp t may now be considered as completely independent 
variations, so that 

p t = e -<i+y>e-« £ ( 

e 



|7F,' since 2 A =1- 



This corresponds closely to the canonical distribution introduced by 
Willard Gibbs. 
The constant /} is to be determined from the relation 



= 21 



« f e" 



2e- 



fle. 



= -|[ln(2e^)] 

This is in general a very intractable equation to solve for ft given U. 
We shall see that by good luck our physical problems can be pre- 
sented in such a form that we first state the value of jS and then 
construct U. 
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We see moreover that 

Pi = 
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-0€, 



2 e-»«. 



--jS^PnGe-".)] 

so that all the properties of the system, in a condition specified by 
this set of p t for the energy eigenstates, can be predicted if we know 
the function 



Z=2e- 



De. 



as a function of and the e f . This function Z is called the partition 
function for the system 

At first sight Z is a function of many, perhaps infinitely many, 
variables, and it is not obvious that it is going to be much use to us. 
However, in the physically interesting cases, all the e are determined 
as functions of relatively few parameters, e.g. the volume available to 
the system and the constants determining its internal interaction, and 
once Z is expressed as a function of these parameters and of /? our 
problem is very much simplified. 

In order to see the full beauty of the subject, we must now estab- 
lish the connection between the pure statistical mechanics we have 
just been discussing and the rules of classical thermodynamics. First 
of all, we note one or two further properties of p t and U. 

For given p, p t decreases as e, increases : the system is more likely 
to be in its ground state than any other single state, however large U 
may be. 

Again, given the set of e,, 

™ = ±(-z-*?Z\ 

dp 8(3 \ dp) 

= Y* [(2 «« e-"0 a -(2 c-' € 0(2 *? e-".)] 

= (2 6 iA) 2 -2 e ?p« 

= < 6 > 2 -<e 2 > 
= -<( e -< e »s> 

< 

since the expectation value of the square of a real quantity is neces- 
sarily positive, where <*>=2*f/>i is the expectation value of any 
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function x depending on the energy levels. Thus U is a monotonic 
decreasing function of j8. (Evidently, <e> is U.) 

Notions of thermodynamics 

Classical thermodynamics is a theory of the equilibrium states of 
physical systems; it gives a criterion for the direction in which a given 
system will move towards equilibrium, but says nothing about the 
rate of this movement. 

Two physical systems can interact in a variety of ways; in par- 
ticular, they may exchange energy or matter with each other. For the 
present, we consider situations in which they exchange energy alone. 
This may take place either by the transfer of radiation, or by mech- 
anical interaction at an interface. We idealize from experience to the 
notion that two systems which would otherwise interpenetrate and 
mix may be separated by a partition which transfers energy from one 
side to another, but is impenetrable to material particles. A practical 
approximation to a partition of this sort is a thin conducting metal 
sheet. A model of such a partition might be a perfectly elastic mem- 
brane 1 which permitted particles on either side to exchange mechani- 
cal energy, by simultaneous collision with the membrane, without 
allowing them to pass through. 

If two systems are thus allowed to exchange energy and left to age, 
we find that they settle to an equilibrium condition, which we take as 
defining thermal equilibrium. We then discover that, given systems 
A, B and C, if A is in thermal equilibrium with B, and is then found 
to be in thermal equilibrium with C (that is, undergoes no change 
of condition on equilibrating with C, as just described), B will be 
found to be in thermal equilibrium with C. Thus systems in thermal 
equilibrium with each other share a common property, which we call 
temperature, and an empirical scale of temperature can be construc- 
ted (as, for example, in the ordinary mercury thermometer) in terms 
of some convenient measurable property such as the relative density 
of mercury and glass at nearly constant pressure. 

Calorimetric experiments then lead one to the idea of a quantity, 
calorimetric heat, which is apparently conserved in an enclosed 
system which is not the seat of a chemical reaction and does not 

1 One can imagine such a membrane so light and stiff that a quantum of 
energy at its lowest frequency of natural vibration would be much larger than 
the thermal energy per degree of freedom in the systems. The heat capacity of 
the membrane itself could thus be as small as one wished. See the later discus- 
sion of the ideal insulating crystal (pp. 34-38), and especially the conclusion 
about the specific heat at very low temperatures. 
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have substantial amounts of work done on it. The unit of calori- 
metric heat is the calorie. The equivalence of heat and energy, which 
we have presupposed in some of the foregoing remarks, is then 
established by the universally constant ratio of energy lost to heat 
gained in all sorts of dissipative processes. 

We are thus led to the idea of the 'internal energy' of a system, 
which one might define as the sum of its heat energy and of the energy 
it possesses as a result of its macroscopic interactions with fields of 
stress, or of electric or gravitational force, and so on. The same idea 
can be reached, however, in a rather more subtle and satisfying way, 
as follows. 

Consider a system isolated by non-conducting partitions so that 
it cannot exchange heat with its surroundings, but suppose that work 
can be done on it, either mechanically, by displacing its boundaries, 
or electrically. If we now take any two states of internal thermal equi- 
librium of the system, it is possible to pass from one to the other (but 
not in general in the reverse direction) by performing work on the 
system in a variety of different ways. For example, if we deal with a 
simple fluid the thermal equilibrium state is defined by volume and 
temperature. Then to pass from one state to another, we may raise 
the temperature by violent stirring at the initial volume, drop the 
volume instantaneously to its final value at any suitable moment, and 
stir on to the final temperature, or combine stirring with volume 
variation in any arbitrary way. In whatever way an 'adiabatic' 
process of this sort is carried out, we find that the total work done in 
bringing the system from a given initial state to a given final state 
is the same. This is one statement of the first law of thermodynamics: 
the work done in the adiabatic transition then defines the difference 
in internal energy of final and initial states. 

There will be possible processes which are not adiabatic, 
connecting the same two states. If the increase of internal energy in 
the process is AC/ and the work done on the system 1 in one such 
process is - W, we define the heat Q supplied to the system by the 
relation 

Q = AU+ W 

We thus have a thermodynamic definition of heat which is com- 
pletely independent of calorimetry. 
Now consider the special (impracticable) class of processes in 

1 That is, the work done by the system is W. The standard choice of sign 
reflects the preoccupation of the men who invented thermodynamics with heat 
engines. 



Information and thermodynamics i S 

which the system passes through a sequence of equilibrium states. In 
such processes, called quasi-static, there would be no dissipation of 
energy within the system; and these processes may, unlike the general 
dissipative process, take place in the reverse direction. For the theory 
of heat engines, a special interest attaches to the cyclic quasi-static 
processes, in which the system returns to its initial state. 

By arguments which are best studied at length in books devoted to 
thermodynamics, one can establish, either by a consideration of 
reversible cycles or, more directly but with more difficulty, from the 
fact that arbitrarily close to a given equilibrium state of a system he 
equilibrium states inaccessible from the given state by any adiabatic 
process, the existence of the quantities called entropy and absolute 
temperature. 

The differential relation for a small quasi-static transition, 

dg = dCH-dJF 

possesses an integrating factor T' 1 such that 

dS = T- 1 dg = T-\dU+dW) 

defines a function S of the state of the system, called the entropy. 
The integrating quotient Tis called the absolute temperature, and is 
easily shown to coincide with the temperature that would be indi- 
cated by an ideal gas thermometer. Its most striking property is the 
existence of a physically meaningful 'absolute zero' of temperature. 
The existence of the integrating factor is trivial for a simple fluid, 
with only two thermodynamic degrees of freedom. This fact is often 
responsible for the suspicion felt by elementary students that thermo- 
dynamics is an empty subject. The result is not trivial for any more 
complex system, and the existence of a definite T for any equi- 
librium situation is what makes thermodynamics possible and useful. 

Hie connection of statistics with thermodynamics 

To establish this connection we concentrate on the following ideas. 

(i) The internal energy of a system plays a fundamental role in 
thermodynamics. 

(ii) Equilibrium between two systems is an easier notion to lay 
hold of than equilibrium within one. 

(iii) The absolute temperature appears as an integrating quotient 
in the heat-work relation. 

We enquire now whether a system whose condition is as broadly 
specified as possible, subject to the condition that the expectation 
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value U s of its energy is known, can be identified as a system in 
thermal equilibrium, and, if so, how U s and jS are related to the 
thermodynamic variables. 
First of all, if the dispersion of the energy is small, that is, 

8p <K Us 

so that values of energy much different from U s are unlikely to be 
found, we will be content to identify U s with the thermodynamic 
internal energy U. 

Let us suppose provisionally that this identification can be made in 
general. 

Now consider two distinct physical systems (1) and (2), with 
states of energy a lti , a 2J respectively, and suppose first of all that the 
expectation energies Ux, U 2 of these systems are separately specified. 

Then the probability that system (1) is in its ith state is 

Pi.i = e-«i a i.'/Z 1 0S 1 ) 

where Z1G81) = S^" 1 " 1 -' 

i 

and moreover U x = 2/'i.< a i.« 

Similarly the probability that system (2) is in itsy'th state is 

P*,, = e-*^.//Z 2 (|8 2 ) 

where Z 2 (j3 2 ) = 2 e~' a ""J 

i 

and moreover U 2 = 2 Pxf**,) 

i 

We now think of these two systems as the separate parts of a single 
joint system whose levels are a (>i =a 1>J +o£ 2>y and whose states are 
enumerated by taking all possible values for the pair of indices /, j. 
The probability that the joint system is in the state /, j is now 

Pu = e-Wi-w+'AjJ/ZiGSOZaOSa) 

Alternatively, consider the situation where we only specify a total 
expectation energy U for the joint system whose states i, j have 
energy levels a tJ . Then we must have 

p tJ = e-^/^08) 
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where Z(fi) = 2 s ~ Ba " 



= z^)zm 



and jS is determined by 






This last sum can be separated as 
^ { ( ? «,, e-,.) (? .-*) + ( ? e-,. )(? « 2 , • -%,)} 

= 7L {^i(j8)Z 1 08) .Z 2 (i3) +^08) . U 2 W)Z 2 W)} 

We now compare this with the situation resulting when the two 
systems are joined not merely conceptually but also physically, to 
the extent that although the energy levels a, and a, are not disturbed 
by the interaction, energy may nevertheless be exchanged between the 
systems. In such a connection total energy is conserved, so we must 
set U=U 1 (p 1 )+U 2 (p 2 ). 

We observe first of all that if & =j8 2 , then on bringing the systems 
into contact there is no change in the condition of either of them. 
Thus equality of J3 is equivalent to equality of temperature. More- 
over, since U u U 2 and U are monotonic decreasing functions of p, 
we have the situation illustrated, for unequal & and )3 2 , where j3 lf £ 2 
may be interchanged, but j3 must always lie between them. Since 

tf03) = tfi(j8) + t/ 2 G8) 

- UtfJ + UtfJ 

the system which begins with the lower value of /} will decrease its U 
value as it equilibrates with the other; that is, the system with the 
lower j3 will give up energy to the other, so that the lower /3 corre- 
sponds to higher temperature. 

Thus, jS is so far identified as an inverse measure of temperature, 
the same ;8 value corresponding to the same temperature no matter 
what the nature of the particular physical systems we are considering. 

The next step is to see whether quantities corresponding to heat 




Fig. 1. U, jS relations for systems 1 and 2 and for their union. 
tf + is denned by I/xCftH U 2 (fi 2 )= C/ + (j3 + )]. 



and work can be picked out in the statistical theory. In thermo- 
dynamic language, we already know that increase in internal energy 
is equal to heat supplied less work done, that is, 

dU = dQ-dW 

Again, from the statistical expression, 

dU = 2eid/>,+2/>, de 4 

It would be very agreeable if we could identify these quantities just 
as they stand. This will now be justified. 
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To see which terms correspond to external work, we recall that 
real processes approximate better to quasi-static processes the more 
slowly they proceed. Suppose that the system is manipulated by ap- 
plying an external force field which varies very slowly with time. 
It follows from quantum theory that a varying field can produce 
transitions between two levels only so far as its time dependence 
contains the frequency v, such that hv is equal to the difference in 
energy of the levels. Thus if the force varies slowly enough it will 
produce as few transitions as we please, so that its effect is entirely in 
changing the e t , without producing any change in the p t } Then 
2 Pi de ( is the expectation value of the change in U produced by such 
a force, and can be identified as the work done on the system, which 
is — dW. In consequence, 2i e t dPi must be identified as dQ. 

The quantity /J is already identified as an empirical temperature. 
To complete the discussion we must find what function of j8 serves as 
an integrating factor for dQ, which will lead us to the absolute 
temperature and the entropy. 

This argument can be short-circuited by asking instead whether the 
information function H has a thermodynamic meaning. We have 

H = ^Ptlnpi 

i 

so for any variation of the p t 

dH= ^Inptdpt+^dpi 
and since p i =Z~ 1 e~ ee > 

dH= -2fcdPt+2Q-toZ)dPt 

Now 2a = 1) so 2 dp t =0, and we have simply 

dH= -pdQ 

1 The critical reader will be worried at this stage for at least one good reason. 
An applied force which changes the e f without altering the pi will in general 
bring the system into a condition which is not one of thermal equilibrium. 
Unless all the energy levels change by the same fraction, the p t cannot be ex- 
pressed in terms of the new *i by a single new value of 0. We can of course 
save the argument in an elementary way by postulating that our ideal applied 
forces do change all energy levels in the same proportion — which is true if the 
frequencies of all oscillators in the system change in the same proportion: 
but we must recognize that in several experimental situations one produces 
different thermal regimes in different characteristic motions of the same system. 
Such situations are not in thermal equilibrium. See pp. 32 and 34. 
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Obviously H is a single-valued function of the condition of the 
system, so j3 is itself an integrating factor of dg. Since we know that 

ds , = r- 1 de 

any other integrating factor such as /? must have the form 

We have already seen that /? is a measure of temperature, inde- 
pendent of the nature, and in particular of the size, of the system, 
whereas S is (like H) an extensive quantity, such that if we consider 
two systems A and B conceptually united, the entropy of the joint 
system is just S A +S B , the sum of the separate entropies. Thus/(5) 
can only be a constant, whose numerical value depends simply on 
our choice of a system of units, and it is conventional to write 

P = kT 
and 

S = -k^p,lnp ( 

where k is called Boltzmann's constant. 

In sum, then, there is a complete correspondence between the 
broadest description of a system of which we know nothing except 
the expectation value of its energy, and the thermodynamic descrip- 
tion of the system with the same value assigned to its internal 
energy. 

Again, substituting p t =e~ 0e '/Z into the expression for S we 
obtain 

S = —k^plnp 

= k[pU+lnZ] 
= ^+klaZ 

or; the Helmholtz free energy jFis given by 

F=U-TS= -kTlnZ 

Fluctuations 

In the elementary thermodynamics of large systems, one normally 

thinks of the internal energy as constant whether the system is in 
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contact with a heat bath — that is, a second system at the same 
temperature, of large thermal capacity — or strictly isolated. Observa- 
tion of a small system such as a Brownian particle, or the moving 
part of a sensitive galvanometer, shows that in thermal equilibrium 
such a system is continually fluctuating in energy as it gains or loses 
it in interaction with its surroundings. The statistical description we 
have chosen to employ is strictly suitable to describe a system inter- 
acting with a heat bath, and so at a constant temperature and not at a 
constant energy. For large systems, the fluctuation of energy is 
usually so small compared to the mean energy that the distinction 
need not be drawn: but for a system consisting of a liquid in equi- 
librium with its vapour at the saturation vapour pressure, to take one 
example, large fluctuations can and will occur. That particular 
situation is not covered by the model we have used so far, which 
corresponds to a system at constant volume. Since the whole thermo- 
dynamics of a system is implied by its thermodynamics at constant 
volume, so long as that volume appears explicitly in the thermo- 
dynamic functions, this limitation is not one of principle. We shall 
later introduce a powerful method which covers such more general 
situations with no increase of effort. It must however be emphasized 
that all of thermodynamics is implied by the results we already have. 

Conclusion 

The statistical description of a system by the set of probabilities 

Pi = Z- 1 e- £ ' /fcr 
where 

Z = 2 e- e «' fcr 

corresponds exactly to the canonical distribution of Willard Gibbs. 
If one is only concerned with equilibrium thermodynamics, the 
statistical mechanics can perfectly well be founded by postulating 
this distribution law. The introductory argument which we have 
worked through has two merits. First, it lends plausibility to what at 
first appears a highly arbitrary distribution, though this plausibility 
can be established by arguments from kinetic theory, and was histori- 
cally so established; secondly, it gives a guide on how to proceed 
when discussing situations, other than those of thermal equilibrium, 
where irreversible processes are advancing at a finite rate. The prob- 
lem of the rates of real processes, however, lies beyond the scope of 
this book. 



THE STATISTICS OF 
SOME SIMPLE SYSTEMS 



In this chapter we shall consider the application of the canonical 
distribution to find the thermodynamical behaviour of some very 
simple physical systems whose energy levels are easily identified. 
This will help to emphasize that the statistical method is applicable 
to all physical systems, regardless of size, which are in a condition 
correctly described by specifying a temperature. It is true that 
statistical mechanics was originally developed as a theory of large and 
complicated systems, but this restriction is no longer necessary. 

The Fermi oscillator 

This simplest of all ideal physical systems possesses only two distinct 

states, with energies and e, say. The partition function is 

Z= l+e- e/fcr 

and the probability of the occupation of the ground state is 

1 
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1+e 


-e/W 


and of the excited state is 










Pi = 
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-c/fcr 
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e - e /*r 


H-e f ' fcr 
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e Pi 
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e e/fcr 
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The real system most directly analogous to the Fermi oscillator 
is a particle of spin \ (i.e. with intrinsic angular momentum hj2) and 
magnetic moment /*, in a magnetic field H. Such particles include 
electrons, protons and various atomic nuclei. In the presence of the 
field, the eigenstates of energy of this particle are the states in which 
the angular momentum, and with it the magnetic moment, is either 
parallel or anti-parallel to the field, giving values of the magnetic 
energy -pH and +pH relative to the cross orientation. With this 
symmetric disposition of the energy values we get a slightly dif- 
ferent appearance of the formulae. 

2 = QfHIkT _i_ g- llHIkT 

= 2 cosh (nH/kT) 
p = dWIZ = (l+e-* ,w * T )- 1 

pi = e-UHIkTjZ = ( 1+e *.a/fcT)-l 



and 



U= -fiHpo+iJ-Hp! 

^llHIkT _ g - ItHlkT 
= — /*■" gUHIkT 1 g- ItHlkT 



= -nHtanhfaH/kT) 

The expectation value of the magnetization in the direction of the 
field 

If nH«kT, then 



m = PP0-PP1 
= ix tanh Qj-H/kT) 



m = fi. 2 H/kT 
= X*H 
where \ a is the susceptibility per particle, so 

Xa = P?lkT 

Again, the free energy per particle is 

F = -ATlnZ 

= - kT In 2 - kT In cosh (jj-H/kT) 

so that the change in free energy associated with the field is 

AF= -kTlncoshfaH/kT) 

and for the usual situation with pH«.kT, 

cosh QiH/kT) ~ \+%jiHlkTf 



"\2 
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so 

2 kT ? U 

The —kT In 2 in i 7 deserves note. It arises from a permanent en- 
tropy of A: In 2 due to the presence of two equally probable states 
when there is no field, that is to say, from a double degeneracy of the 
ground state in the absence of a field. We note also that AjF= -\Hm 
is just the usual estimate in magnetic theory for the energy associated 
with the induced magnetization m. 

A different use for the idea of the Fermi oscillator will arise when 
we are considering systems of many particles, so need only be noted 
briefly at this point. Suppose that we are concerned with a system 
containing a number of similar particles which obey Pauli's exclusion 
principle, and that we have a set of suitable independent states to 
describe the motion of the single particles. Then a state of the whole 
system is completely defined when we state which of the one-particle 
states are occupied. Provided we can associate a definite energy 
with each one-particle state (this takes a good deal of arranging for 
strongly interacting particles), then since any such state is either un- 
occupied, or occupied by at most one particle, it can be regarded as 
having a Fermi oscillator associated with its state of occupation. 
We shall make more use of this idea later. 

The harmonic (Bose) oscillator 

We take from quantum mechanics the result that the energy levels 
of a harmonic oscillator, measured from the value of the potential 
energy at the centre, are at (n+%)hv, where n is any positive integer or 
zero, and v is the frequency of the oscillator. For an oscillator in one 
dimension, or with one degree of freedom, the levels are single, that 
is, with only one distinct state at each energy level, so the partition 
function is 

00 

2 = V e -(n + i)hv/fcT 

n = 

p-ihvlkT 



I _ e -hvlkT 

The probability of the nth excited state is 

p n = e -^+i)MkTj Z 

_ g-nftv/fclYJ _Q-hvlkT\ 

Thus U = | p n (n+i)hu 
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and it is a simple exercise to reduce this to 



u = ^+- hv 



2" r e iwfcr_ 1 
if one remembers that 

The easiest way to obtain the entropy is not by hammering out 
2/? Inp but by observing that since 

U-TS = F= -JcTlnZ, 
S = klnZ+ U/T. 

These expressions can be simplified at extreme temperatures. In 
particular, if hv«ikT, we can expand the exponential in the denomi- 
nator in U and find U~kT+ powers of hvjkT (the temperature 
independent term in fact cancels out exactly); while, if hv»kT, only 
the first excited state contributes appreciably, and we have 

Thus at high temperatures, the specific heat C, which is dU/dT, has 
the constant value k given by the empirical rule of equipartition of 
energy, whereas at low temperatures the specific heat drops ex- 
ponentially with I IT. The exponential of course goes to zero so fast, 
as T decreases, that it kills the infinity in C which would otherwise 
arise (from the T~ 2 factor which appears on taking the derivative). 

Again, these results turn out to have a direct application to the 
many-body problem. If the particles constituting a many-body 
system have zero or integral spin, they are of Bose type and any 
number of them may appear in the same one-particle state. The 
energy spectrum associated with that state has thus the same intervals 
as an oscillator spectrum. (According to convenience, one either 
uses the same expressions as we have just derived, or cuts out the 
factor e -* v/2fcr in Z and the added hv/2 in U.) 

The rotor with two degrees of freedom 

The rotation of, for example, a diatomic molecule consisting of two 
different atoms, has a fairly simple description in quantum theory. 
In classical mechanics, such a pair of point particles has an angular 
position fixed by a polar angle and an azimuth angle, and in free 
rotation a corresponding pair of angular momenta are constant. In 
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quantum mechanics, the total angular momentum is fixed by a quan- 
tum number j, and the motion is determined by j, together with the 
angular momentum about a specified axis. These are connected by 
the fact that for given j, the maximum value which the component 
angular momentum can assume isjh. 

Since, however, the component angular momenta about different 
axes are not mutually compatible observables, the transverse angular 
momenta have zero expectation value but non-zero dispersion in 
such a state, so that the expectation value of the square of the total 
angular momentum works out atj(j+l)fi 2 instead of the./ 2 /! 2 one 
might have expected. The corresponding energy of rotation is then 

where A, the moment of inertia of the molecule about its mass centre, 
is equal to m 1 m 2 r^J(m 1 +m 2 ) for atoms of masses m lt m 2 at separa- 
tion r . 

The number of states with this energy is the number of distinct 
values of the component angular momentum from jh to -jh at 
intervals of h, which is 2/+ 1. Thus the energy levels of the rotor are 

•!-£*/+!) 
with degeneracy 2/+ 1. Then 



Z = 2(2/+l)e-«> + 
o 



DeiT 



where we have introduced for convenience a characteristic tempera- 
ture 6=h 2 /2Ak. Setting further X=j+i we have 

Z = f2Ae- (;i2 - iwr 

1/2 

Again, this expression is tractable at extreme temperatures. At very 
high temperature, that is, for T»6, the series contains many terms 
which contribute appreciably to its sum, and is quite accurately 
equivalent to the integral 



1 



2Ae-»«M"dA 
o 

T 

e 



Then U=kT — equipartition as for the oscillator. 
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At low temperature only the first few terms of the various series 
contribute. Thus we have 

Z = l + 3e- 2e ' T + 5e- 68 ' T +--- 

_ ,_ /1 / 6e- 2wr +30e- 6fl/r +--- \ 
u - k °\ i + 3e- 2fl ' r +--- / 

= fc0(6e- 2 <" T -18e- 49/r ) 



and 



-*(J)o» 



te -2wr_72g-4«r ) 



where U is most easily obtained by direct construction from the p } 
and C is again 8U/8T. 

If our molecule consists of two atoms with identical nuclei there 
is a complication, since some states of rotation of the molecule may 
not occur. This is considered later on. 

An ideal rubber 

Since we have been concerned so far with sums over energy levels, 
it is amusing to digress on a system whose energy levels are all zero 
and whose behaviour is determined wholly by its entropy. The 
familiar fact that rubber under given tension shrinks when warmed 
indicates that a good deal of the elastic free energy stored in the 
stretched rubber is associated with an entropy term. The entropy is 
known to arise from the vast number of different configurations of 
nearly equal energy into which the long chain molecules of the rubber 
can coil themselves. 

To obtain a simple calculation, we idealize ruthlessly from three 
dimensions to one, and from real atoms occupying space to atoms 
which are merely the pivot points of rigid links. A rubber molecule is 
viewed as a chain, rather like a surveyor's chain, laid along a straight 
line of length La with a certain number of segments turned back- 
wards. Suppose there are N links, of length a, and let n be the 
number of those arrangements of the links which give the same total 
length La. Then 

Z=n 

F= -kTkin 

U=0 

S = k In n 
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To evaluate n, note that the total number of links pointing for- 
ward is %(N+L) and the total number of backward links is i(N-L). 
The corresponding number of distinct sequences of forward and 
backward links is found by considering the links as a set of distinct 
objects, and evaluating the number of ways of choosing \{N+L) out 
of N. Thus 

= m 

" [(N+L)I2V.[(N-L)/2]1 

To find its logarithm we apply Stirling's theorem, which gives the 
first terms of the asymptotic expansion 

In (JV!) = (N+ i)lnN-N+i In (277) + 0(N~ x ) 

If iVand UN— L) are both large, we may neglect the terms of lower 
order than N, so 

c/i. 1- am *r T^+^i N+L, N-L. N-L] 

S/k = In n ~ Nln N- — ^— In — j— +— y- ln ~ T - 

N+L. 2N ,N-L, 2N 

In rrr— - + _ In - 



2 N+L 2 N-L 
= -N[xlnx+(l-x)ln(l-x)] 
where x=(N+L)/2N and 

F = -TS = NkT[x In x+(l -x) In (1 -x)] 

The tension P in the chain is then the derivative of free energy with 
respect to length, so 

p _ 8F = _T8S 
d(La) ~ adL 

= NkT m ol [x ln * +(1 ~ x) ln (1 ~ x)] 

kT. x 
= x-ln- 



2a 1— x 
kT, N+L 

= 25 ln iv=Z 

If now we consider Z,« JV, we have 

N+L. 1 _ LO , /Ar 

n W Z, 
so i> = — 

a N 
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giving elastic behaviour at constant T, with elastic constant kT/Na 2 
(since length = La), proportional to T and inversely to N. 

Considering the brutality of the model, this is not too unlike the 
behaviour of actual rubbers. It may seem odd that we get a tension 
out of a system which has no mechanical properties, but this is less 
surprising if we think of the kinetics of the model in more than the 
one dimension. The essential property of the chain is that the links 
themselves are inextensible, but this means that the molecules of the 
heat bath, knocking the links sideways in the exchange of thermal 
energy, produce lengthways tension in the chain. This process is 
much the same as the increase in the tension of a clothes-rope at 
every blow of a carpet-beater. The great power of statistical thermo- 
dynamics is that we can find the average tension (but not its rate of 
fluctuation) without the long and difficult consideration of a detailed 
kinetic model. 

One particle in a box 

The real walls confining the particle are replaced by an equivalent 
potential field. No generality is lost, so long as we consider volume 
rather than surface effects, if we take the box as a cubical region of 
edge a in which the potential energy is constant, and can be taken 
to be zero. At the boundary faces of the cube the potential rises to an 
indefinitely large value, so that the wave function of the particle 
vanishes at and outside these boundaries: this corresponds to the 
fact of the particle's imprisonment. 

The standing wave pattern formed inside the box is then such that a 
whole number of half wavelengths occupies the distance a between 
each pair of opposite faces. This number need not be the same in 
the three different directions. Taking axes of x, y, z along the cube 
edges we have then wavelengths 

X x = 2a\n x 
and corresponding momenta 

Px = njt/2a 
so that the squared total momentum has value 



P 2 = TrAnl+nl+nt) 



The distinct states can thus be represented as points in the space 
whose co-ordinates arep x ,p v and/?.,, with positions given by positive 
integer multiples of h/2a. Thus the volume of momentum space 
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'effectively occupied' by each state is (h/2a) 3 , and in terms of the 
volume of configuration space V ( = a 3 ) available to the particle this 
is A 3 /8 V. In other words, the density of states in momentum space is 
8 Vjh 3 , and, since only one octant of this space is being used, the 
number of states in the shell between^? andp+dp is on the average 

■¥2 P dp = lF P dp 

It can be shown (with some difficulty) that this result holds for 
volumes of other shapes, the first correction term corresponding to a 
surface energy. 

The kinetic energy of the particle is ]ustp 2 /2m, so the ground state 
has energy 

3h 2 
Sma 2 

and the three next states have energy 

3h 2 
4ma 2 

If then V or T is so large that 

h 2 /(mV 213 ) « kT 

as will generally be the case when this model is of some actual use, 
the sum 

Z = „J. n / xp [-8^r (H|+ ^ + ^] 

can be replaced by the integral 

Z= r *0- e-<> 2 ' 2mkT p 2 dp 
which is brought by a change of variable to standard form 
Z = ^(2mkT) 3 ' 2 ( X x 2 e~* 2 d* 

and hence (Reader: check this!) 

U=\kT 

(2irmk\ 312 
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We have worked out this bit of wave mechanics in detail because 
of the great importance of the 'particle in a box' as an element in 
constructing models of more complicated systems. Obviously its 
most direct application will be to the thermodynamics of the ideal 
gas, but before discussing that we must look at a problem whose 
solution is at once constructed from results we have already obtained. 

One heteronuclear diatomic molecule in a box 
Evidently the configuration of the molecule is completely determined 
if we know the position of its centre of mass, the direction of its axis, 
and the separation of the atomic nuclei, provided that the electron 
system which holds it together remains in its ground state. If this is 
so we can ignore the detailed variation of the electron charge dis- 
tribution during the movement of the molecule, and regard it simply 
as a background giving rise to an empirical force between the nuclei. 
This neglect is justified for most molecules at moderate temperatures. 

The motion of the mass centre is independent of the other motions 
of the molecule; rotation and expansion are not quite independent, 
as is easily seen since angular momentum is conserved in the free 
motion of the molecule, so that angular speed must drop if moment 
of inertia increases. Moment of inertia is proportional to the square 
of the internuclear distance, so expansion or contraction of the 
molecule at constant angular momentum results in a changed 
angular speed and energy of rotation. Thus energy is exchanged 
between the rotary and vibratory motion of the molecule. 

However, if the chemical binding force due to the electrons varies 
sharply enough with distance, the vibration of the nuclei along the 
axis of the molecule has high frequency and small amplitude, and the 
rotation and vibration can be treated as independent without serious 
error. Indeed, if the temperature is high enough for this to be un- 
justified, there is a substantial chance of the molecule being torn 
apart by the centrifugal force of its own rotation, and it becomes 
necessary to treat the dissociation equilibrium, which we shall come 
to later. 

For moderate temperatures, then, we construct the partition func- 
tion as a sum over states of energy e, where distinct states are denoted 
by quantum numbers n tr , j, v referring to the translation, rotation 
and vibration of the molecule, and we have 

« = e(r(n) + e r O') + *„(*>) 
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where m is the total mass and A the moment of inertia of the mole- 
cule. The constituent parts of e are sufficiently distinguished without 
the subscripts, so we drop those and write 

_ V e -WkT)[£(n) + eU) + e(v)l 

_ /y e -e(n)/fcT\ /V e -c<«"cT\ /V e -£(u)/kr\ 

where Z t „ Z,, Z„ are the partition functions we have already calcu- 
lated for the particle in a box, the rotor, and the harmonic oscillator. 

Note that a factorization of the partition function in this way will 
always be possible when the energy of the system is a sum of quite 
independent terms. This is of course formally exactly the same posi- 
tion as we met in making the conceptual union of two physically 
distinct systems. Conversely, in the approximation we are using, 
the translation, rotation and vibration of the same molecule may be 
regarded as distinct physical systems, which might even be at dif- 
ferent temperatures. 

Such physical situations actually arise. Suppose for example that 
a gas which is in full thermal equilibrium is rapidly compressed. 
The translation of the molecules has no direct coupling to the rota- 
tion and vibration, so the increased kinetic energy of translation 
which arises from the adiabatic compression may leak relatively 
slowly, by molecular collisions, into these other modes of motion. 
The transfer of heat dg from a system at T x to one at a lower tem- 
perature T 2 involves an increase of entropy dQiTz 1 -^ 1 ), so this 
dissipative process contributes to the damping of sound waves in 
gases with polyatomic molecules. 

There is, as we have already seen, an intrinsic coupling between 
rotation and vibration. This is so contrained by conservation of 
angular momentum that the leakage of energy from rotation to 
vibration is also controlled by inter-molecular collisions. 

The practical importance of the simple result obtained in this sec- 
tion is that the moment of inertia (or moments, for a non-linear 
molecule) and the frequencies of internal vibration of a molecule can 
be obtained by spectroscopic observation of the vapour. Such 
observations may be easier and are often much more precise than 
direct measurements of thermal properties over a wide temperature 
range, so that thermodynamic tables computed, using statistical 
mechanics, from spectroscopic data are often employed instead of 
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direct observation. The detailed calculation is often rather more 
complicated, but the ideas are those that we have just discussed. 

Negative absolute temperatures 

At this stage we can deal with a topic which was hinted at in one of 
the problems of the introduction; namely, the existence of situations 
which are described by a negative absolute temperature. In such a 
situation the population of some group of energy states must in- 
crease with increasing energy according to the expression exp ( — elkT) 
which is now exp (e/k \T\). For this to make sense, the group of 
states must have an upper bound in energy. 

Such an upper bound is not known to exist for any complete 
physical system, but it may well exist for the energy associated with a 
particular set of degrees of freedom. For example, in a non-para- 
magnetic substance containing hydrogen, the spins of the hydrogen 
protons are very weakly coupled to the other modes of motion of the 
system, particularly in the cold solid state. The spin motions then 
constitute a sub-system, and since the proton spin angular momen- 
tum is h/2, the spin system is equivalent to a set of just such Fermi 
oscillators as were discussed above. 

If for simplicity we neglect the magnetic action of the spinning 
protons on each other, only one typical proton need be considered, 
and the argument given before will apply. In the condition with the 
highest possible internal energy, the upper level is occupied with 
probability 1, and the lower with probability 0. This corresponds to 
the limit T= — S as 8 tends to zero. Both levels are equally occupied 
at both T= + 00 and T= — 00. The scale of rising energy runs from 
T=+0 to T= + co, which is the same as T=—co, and goes on 
rising from T= -00 to T= -0. Over the same range, the function 
— T' 1 runs continuously from — 00 through to +00. Note that all 
negative temperatures are hotter than all positive temperatures. 

At a finite negative temperature the equilibrium magnetization per 
proton in a field H is again 

m = -ptemhQiH/kT) 
= -nta.nh(ji.H[k\T\) 

i.e. is opposed to H. It is true in general that when # is applied to a 
negative temperature magnetic system, instead of the field doing 
work on the system, energy is fed into the field. This will give rise to 
an actual power amplification in a suitable (i.e. resonant, in the 
sense that h x frequency =2fiH) alternating field. 
This unlikely sounding effect does occur and is the basis of masers 
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and lasers. The practical problem is to find a moderately efficient 
way of producing the negative temperature state. This can be done 
for the protons by using a suitable pulse of coherent resonance radia- 
tion, which exactly inverts a spin distribution, equilibrated at 
ordinary temperature T, to give the distribution characteristic of 
— T. In other systems, e.g. the ruby laser, an intense flash of in- 
coherent radiation (itself essentially at T= oo) may suffice, through 
the population of high metastable levels surviving the pulse, while 
intermediate unstable levels depopulate rapidly towards the ground 
state. 

In this last case, the notion of negative temperature can be applied 
only to the population ratio of a given pair of levels, and the whole 
distribution is not described by a temperature at all. Again, in an 
electric discharge, the electrons may be described by an electron 
temperature around + 10* K, the gas atoms by a temperature around 
+ 10 3 K, and some particular pair of atomic or molecular excita- 
tions by a 'temperature' of — 10 2 K. 

Negative temperatures are easily understood and treated in the 
context of statistical mechanics. It is interesting to note that if the 
lowest and highest energy states of a system are both non-degenerate 
the entropy vanishes on both sides of absolute zero, that is, for the 
coldest and hottest conceivable states of the system. In general, the 
entropy need not be the same at +0 and —0 absolute. 

Since, as noted above, all negative temperatures are hotter than all 
positive temperatures, special care must now be taken with the word- 
ing of the second law of thermodynamics. The standard form which 
survives is that due to Clausius — that heat cannot be transferred by a 
cyclic process from a cooler to a hotter body without other changes 
being simultaneously produced. The Kelvin form — that an amount 
of heat cannot be converted completely into work by a cyclic pro- 
cess without other changes being simultaneously produced — fails 
because a reservoir at negative temperature rises in entropy as it 
gives up energy, dropping towards negative infinite temperature. 
The principle must be qualified by saying 'heat from a reservoir at 
positive temperature' and an inverse impossibility stated, that of 
converting an amount of work into heat supplied to an object at 
negative temperature by a cyclic process with no other effects. 

The ideal insulating crystal 

In a crystal which is a good insulator the whole electron structure 
remains in its ground state practically all the time, except while at 
very high temperature, since the least energy required to excite an 
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electron is several electron volts. Thus, to understand the thermo- 
dynamic behaviour at moderate temperatures we need only consider 
the movements of the atomic nuclei in the crystal. These can be con- 
sidered as point particles for the present purpose. 

Since the crystal as a whole obeys Hooke's law for small strains, 
the restoring force on an atom must be a linear function of its dis- 
placement with respect to its neighbours, so long as this displace- 
ment is a small fraction of the interatomic distance. Thus for a 
crystal containing N atoms we shall obtain 3JV linear equations of 
motion in the absence of external forces. 

These 3N equations can be imagined combined to give three equa- 
tions expressing the constancy of linear momentum of the whole 
system, three equations expressing the constancy of angular momen- 
tum of the whole system and 3JV— 6 equations of harmonic 
oscillation describing the oscillatory normal modes of the crystal. 
Alternatively, we may change the physical picture by supposing the 
crystal constrained, for example, by spring plates pressing against its 
faces. The motions of translation and rotation of the whole crystal 
will then be oscillatory, and the equations of motion of the surface 
atoms will have additional terms added in, corresponding to the 
pressure of the plates against them. However, for a crystal contain- 
ing many milliards of atoms, these changes produce very little 
change in the distribution of the normal oscillations, most of which 
will be shifted in frequency by a fraction of the order of the reciprocal 
of the number of atoms in a diameter of the crystal. The total effect 
of this will be the appearance of extra terms, proportional to surface 
area, in the thermodynamic functions. These we ignore for the 
present. 

Since there are very many modes of lattice vibration, we summa- 
rize them by a distribution g{v) which is defined by the statement that 
the number of modes whose frequencies lie in the interval from v to 
v+dv is g(y) dv. We must then have 



/; 



g(v) dv = 3N 



and for all but the smallest crystals g(v) depends on N only by a 
multiplying factor, so that g'(v)/g(v) is independent of N. 

The density function g(y) can be determined by arbitrary fiat, 
plausible argument, or appeal to experiment, the choice depending 
on the intended use of the results. The simplest assumption, due to 
Einstein, is that we are to consider each atom oscillating as if its 
neighbours are at rest in their equilibrium positions. Provided all 



3 6 Elementary Statistical Mechanics 

atomic positions are equivalent, this gives at most three distinct 
frequencies, and every atom has a motion compounded of these 
frequencies, so that in particular 

- = - (vi+V2+V3)+ ^___ 

At high temperatures U~ 3NkT, which corresponds to Dulong and 
Petit's law. Moreover, the specific heat falls off for W</n/ mIn , 
which corresponds to the drop in the measured specific heat of 
crystals at low temperature. Unfortunately the specific heat of this 
model goes to zero, at very low temperatures, as 

y-2 e -fcv mln /kr 

which is very much faster than the observed rate. 

A more plausible model is due to Debye, who pointed out that, 
whatever the complete lattice spectrum may look like, the low fre- 
quency modes must correspond to the normal modes of oscillation of 
a block of elastic continuum of the same size, and with the same elas- 
tic constants as the crystal. Since, at low enough temperature, all the 
high frequency oscillations are frozen out, in their ground state, and 
are not contributing to the specific heat, this model should give 
the low temperature specific heat correctly. It is obviously correct at 
high temperatures, as is the Einstein model. 

Consider a cube of edge a composed of an isotropic elastic sub- 
stance. The vibrations of such a material can be resolved into longi- 
tudinal waves, in which the elastic displacement is normal to the 
wave front, and transverse waves, in which the displacement lies 
within the wave front. For a given wavelength and direction of 
propagation, there may occur one longitudinal wave, and two inde- 
pendent transverse waves in which the displacement directions are 
mutually perpendicular. 

To define the problem, suppose that the faces of the cube are 
pressed against planes of some extremely hard substance, so that 
displacements there are zero. Then the modes of oscillation of the 
cube will have exactly the same pattern as the de Broglie waves we 
discussed before for a particle moving in a cubical box, and corre- 
sponding to the mode numbers n x , n y , n z we have a wavelength A such 
that 

The volume of fc-space 'occupied' by each mode is (2a) ~ 3 or 1/8 V. 
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The number of modes occurring on the average between k and 
k+dk is therefore 

4nVk 2 dk 

Corresponding to each mode are three distinct wave motions, pro- 
pagating with two speeds, c, for the longitudinal wave and c t for the 
transverse waves. For large wavelength and low frequency these 
speeds are constant and equal to the speeds of sound. At high fre- 
quencies, c will depend on k in some way not very easy to predict. 

The thermal behaviour of the oscillations depends on their 
frequency, which is related to the wavenumber by 

v = kc 

so that the number of oscillations between frequencies v and v+dv is 

47rF(cf 3 +2cr 3 > 2 dv = 12irVc-'Wdv 

where c is the average speed defined by 

3 C - 3 = cr 3 +2cr* 

This value for the number of distinct oscillations will be correct at 
fairly low frequencies. The limit of 3 JV on the total number of oscilla- 
tions implies that there will be a maximum k, k m , such that 

4ttV f "" k 2 dk = JV 



7 " 



An _ N 
i.e. - K m — y 

the number density of atoms in the material. Thus k m is of the order 
of the reciprocal of the nearest-neighbour distance between atoms. 
This k m will be the same for longitudinal and transverse waves, so 
maximum frequencies will not be the same, and there will be a band 
of frequencies in which only one type of oscillation occurs. 

However, deeper examination shows that dispersion, i.e. the vary- 
ing speed of the waves, causes worse complications than this at high 
frequency. Thus we do the model no real injustice by the apparently 
inconsistent procedure of assuming a density of oscillations in fre- 
quency simply proportional to v 2 , up to a v m defined by 

UttVc- 3 f " v 2 dv = 3N 



f: 
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i.e. v m = ck m 



I 3N\ 113 



KAttV) 



This maximum frequency v m is characteristic of the material, not 
of the sample, and is used to define the Debye temperature D by 
the relation 

hv m = kO D 

The Debye temperature can be predicted from the speed of sound in 
the material and its number density, but is often treated as an 
empirical parameter to be fitted to the low temperature specific 
heat data. (The true g(v) can be found from X-ray and neutron 
scattering experiments.) 

The number of oscillations between frequencies v and v+dv can 
now be written as 

and the internal energy per atom is, omitting the zero-point energy, 

hv 



£ = 2- f v " 



1 dv 



gftv/fcr _ j 

_ 9k r 8 d e 3 de 
~ ei)o e e ' r -l 

The integral is a tabulated function of T/d D . For very low T, 

1 e - fl/r d0 



U 9k r 

n ~ e% J 



where the very rapid decrease of the exponential allows us to replace 
On by infinity at the upper limit of integration. 
Thus 

U 54A:r 4 









N 


ei 
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specific heat 


per 


atom 

Cy ~ 


2l6kT 3 



That CyCcT 3 for small T is in good agreement with experiment, 
and to this extent the Debye model is entirely successful. It is less 
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successful at intermediate temperatures where the less regularly 
distributed high frequency oscillations are coming into play. 

Black body radiation 

Closely related to the problem of elastic waves treated in the pre- 
vious section is the case of electromagnetic radiation in thermal 
equilibrium. We consider a cavity of volume V whose walls are at 
temperature T. If the walls are almost perfect reflectors, the tangen- 
tial electric field must vanish at the walls, and the permitted wave 
patterns are the same as for the particle in a box. The only change is 
that the patterns with one of n x , n„ or « 3 =0 are now possible, which 
was not the case before; these however contribute only to the sur- 
face-dependent part of the energy, and not to the bulk energy. For 
the wave patterns with n x , n y and n z non-zero there are two inde- 
pendent states of polarization possible, so the number of independent 
oscillations with wave numbers in the range k to k+dk is 

SnVk 2 dk 

Of course, if the walls are quite perfect reflectors there is no 
mechanism for the emission or absorption of radiation, but we are 
concerned only with the thermal equilibrium state, and the least 
imperfection in the reflection of any part of the wall will enable this 
equilibrium to be reached eventually, without perceptibly disturbing 
the spectrum of the cavity oscillations. 

The speed of light in vacuum c, is the same for all k, so the fre- 
quency of an oscillation of wavenumber k is ck and the quantum of 
energy associated with it is hck=hv. The number of oscillations in 
the range of frequencies from v to v+dv is 

— 3- v z dv 
c a 

so the density of energy in the field is 

hv „ , 



V c 3 J 



where the upper limit of frequency is infinite since, so far as we know, 
there is no natural upper bound to the frequency of possible photons. 
Then, with a change of variable, 



U , tkTY r x 3 dx 
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Now 



p x 3 dx = p^j 
Jo e*-l ~ Jo 1- 



and since exp (— x) < 1 for ;t> we can write 

J le _ a; dx= I x 3 e-*(l+e-*+e- 2 *+---)dx 

00 /* 00 

= 2 x 3 e~ nx dx 

n = l Jo 

CO i*00 

= 2 "~ 4 x 3 e~ x dx 

n = l Jo 

00 

= 6 2 «" 4 

n = l 

The series 2«~ 4 converges rapidly. This sum is called £(4) where 
f( 5 )— 2 n ~ s f° r re al or complex s is known as Riemann's ^-function. 
The evaluation of the ^-function is not elementary, so we just take the 
value 7r 4 /90 from tables, and end with the energy density 

U %nhc (wkT\* 
V 15 \ he J 



We see, first, that U/V is proportional to T 4 , in agreement with 
Stefan's law, and secondly, that if we define a certain wavelength A 
of the thermal radiation by X =hcfcTrkT), we have 

U 8flf kT 
V~ 15 * Ag 

~ 6 Ag 

which can be expressed verbally by saying that energy kT is as- 
sociated with each direction of polarization (2) and propagation (3) 
in each volume A 3 ,, where the characteristic distance A is itself in- 
versely proportional to kT. The differential energy density associated 
with the frequency interval dv at v which is 

dV Sirh v 3 du 
y c 3 e *v/fcr_j 
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can be expressed in terms of the corresponding wavelength X=cjv as 

Zirhc dA 
A 5 e"V*— 1 

If A«A , this expression is dominated by the exponential term and 
becomes 

dU &nhc „, „ ,. 
= — ^ e -*V* dA 
V A 5 

while if A»A , in the far infra-red, we can expand the exponential and 
obtain 

dU ZirkT „ 



SYSTEMS OF IDENTICAL PARTICLES 

Physical identity 

The simple systems of the previous chapter had the common feature 
that they contained only a single motion or an easily identified set of 
completely independent motions, so that the distinct quantum states 
of each system were enumerated in a perfectly straightforward way. 
The reader's attention has already been drawn, in the section on the 
rotor, to the next problem that must be faced: systems composed of 
identical particles have certain restrictions on the states of motion 
which are accessible to them. 

These restrictions are most easily understood in terms of the wave 
function which describes a pure state of such a system. This is a com- 
plex function f(r, . . ., r', . . .) of the positions and spins of all the 
particles of the system, such that f*f dr . . . dr' . . . measures the 
probability that the system in the given pure state should be found 

in the range of configurations r->r+dr, . . ., r'->r'+dr' f* is 

complex conjugate to f. Here r must be taken to include both position 
and spin co-ordinates. Now it appears that identical particles are in 
fact so strongly identical that the physical situation which arises 
when two of them are interchanged is exactly the same as the ori- 
ginal. This can only be so if the corresponding wave function 

*(r',...,r,...) 

differs from the original 

*(r,...,r',...) 

only through multiplication by some (possibly complex) number of 
modulus 1. Since a repetition of the same exchange gives us back a 
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situation which is formally as well as physically identical with the 
original, the square of this number is 1 and its only possible values 
are +1 and —1. 

Moreover, if we have identical particles A, B, C in the same system, 
the exchange of A and B may be achieved directly or by the sequence, 
B with C, A with C, and B with C again. Thus the effect of the ex- 
change of A with B on the wave function must involve the same 
multiplying factor as the exchange of A with C, and the same again 
(using AC, BC, AC) as the exchange of B with C. Thus in a given 
system the exchange of any pair of identical particles of a given 
species multiplies the wave function by the same number + 1 or - 1. 
The sign might still vary from system to system as well as from spe- 
cies to species. 

It turns out that all the elementary particles fall into two classes 
known respectively as bosons and fermions. Bosons have a spin 
angular momentum of zero or an integral multiple of h; fermions 
have a spin which is half of an odd integral multiple of h (all the 
fundamental fermions known at present have spin /j/2). Electrons, 
nucleons and compound objects containing odd numbers of these 
particles (e.g. H or He 3 atoms) are fermions; photons, 7r-mesons and 
compound objects containing even numbers of fermions (e.g. He* 
atoms) are bosons. 

The wave function of a system is unaltered by the exchange of two 
bosons of the same sort; it changes sign on the exchange of two 
fermions of the same sort. Since the wave function, which is con- 
tinuous, must then vanish whenever two like fermions have the same 
co-ordinates, it follows that two like fermions can never coincide in 
both position and spin. 

Another expression of the rule for identical fermions is the familiar 
exclusion principle of Pauli: if the state of the whole system is 
constructed from a set of independent single-particle states, no two 
identical fermions can occupy the same state. 

Ortho- and parahydrogen 

In the H 2 molecule the two protons are fermions. The symmetry of 
the wave function with respect to exchange of the positions of the 
protons is determined by the rotational quantum number, j, the 
effect of the exchange being to multiply the wave function by (- 1)'. 
Since the effect of exchanging both position and spin must be to 
multiply the wave function by - 1, the effect of exchanging spins 
alone is to multiply by - 1 for even j and by + 1 for odd j. The 
one antisymmetric spin wave function corresponds to antiparallel 
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spins, giving one state of zero total spin angular momentum: the 
three symmetric spin wave functions correspond to parallel spins, 
giving the three distinct states of orientation of a unit total spin 
angular momentum. 

Since there is no dynamical interaction compelling the exchange of 
angular momentum between the proton spins and the rotation of the 
whole molecule, the stationary states of the molecule fall into two 
families: orthohydrogen with odd./ and parallel proton spins: and 
parahydrogen with eveny and antiparallel proton spins. The ground 
state, withy = 0, belongs to parahydrogen and lies distinctly below 
the first ortho-state atj= 1, so that at very low temperature hydrogen 
molecules should be practically all in the parahydrogen ground state. 

In the absence of a strong magnetic field, which would orient the 
magnetic moments of the protons, the different spin states make no 
contribution to the energy, so the partition function for the rotation 
of the molecule can in general be written 

Z = Z P +3Z 

where Z p and Z are sums over the states with even and oddy respec- 
tively. If T»6, the rotational characteristic temperature previously 
defined, then 

Z p - Z = 27(20) 

and the ratio of parahydrogen to orthohydrogen in the equilibrium 
mixture is just 1 to 3. 

Two interesting points now arise. The first is that hydrogen mole- 
cules, even in the liquid, apparently rotate with very little mutual 
interference, so that the distinction of ortho- and parahydrogen is 
still valid even in the cold liquid. The second is that the very weak 
coupling between the proton spins and the molecular rotation makes 
the transformation of ortho- to parahydrogen exceedingly slow. In 
fact, the transformation has to be catalysed by the very high varying 
magnetic field of a paramagnetic atom or molecule (e.g. dissolved O a 
or a metal surface) to proceed at a useful rate. Hydrogen liquefied 
without catalysed conversion will contain three parts of ortho- 
hydrogen to one of parahydrogen, and the approach to the equili- 
brium ratio is a thoroughly irreversible process, taking place at 
paramagnetic centres in the container walls. The heat liberated in 
this process is enough to cause an alarming evaporation loss on 
storage. 

Note that in the classical limit, T»6, Z~2T/8, where one would 
expect for distinguishable particles of spin \ to have 2 x 2 x T/8 — a 
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factor of 2 for each particle. In this limit, then, the identity of two 
particles just halves the 'classically' expected partition function. 

The ideal gas—Gibbs' paradox 

Our molecular model of an ideal gas will consist of JV similar par- 
ticles moving without interaction in a volume V. If their movements 
were totally independent and the particles were all distinct, we should 
then expect a partition function 

Z H =Z? 

where 

z. - (fiFT y 

as we found before for a single particle. It is easy to show that this 
leads to results in contradiction with experience. 
The entropy of a single molecule moving in Fhas been found to be 

(?irmk\ 3 ' 2 
If Z N =Z1, we should then have 

~JT) + * R 

where R=Nk. Now suppose we have two volumes, each V, each 
containing N molecules and separated from each other by a valve. 
The total entropy is 2S N (V). If the valve is now opened, the whole 
volume 2Fis accessible to every molecule, and the entropy should 
be 

(1fnmk\ 3 ' 2 
IT) +3r 
= 2S N (V)+2Rln2 

The entropy of mixing, 2R\n2, does indeed appear when two 
different gases diffuse into each other, but does not appear when we 
have the identical gas in the two volumes. The difficulty is removed 
when we recognize the indistinguishability of physically identical 
molecules. 
We consider first a system consisting of only two identical bodies. 

Two non-interacting identical particles 

The protons in the hydrogen molecule are almost rigidly connected 

together. We now consider two particles which have no dynamical 
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interaction, their movement being controlled by an external field of 
force which is the same for both. The change in behaviour of each 
particle in the presence of the other is thus determined entirely by 
the indistinguishability of the particles and their character as bosons 
or fermions. 

Let the distinct energy states of one particle in the external field 
have energy a t . If i^j, there is one state of the system with energy 
e = a t + aj. If / =/, there is one state of the system with e = 2a t provided 
the particles are bosons: there is no such state if the particles are 
fermions. Thus the partition function for this system is 

Z = \ 2 e- te . + "W+ 2 e- 2< V fcr for bosons] 

where the factor \ in the first term is needed because each single- 
particle state occurs once in the sum over i and again in the sum over 
j, so that every pair i, j is named twice in the sum. This can be re- 
written as 

Z = ^2e~ (a i +a P" cr ±jTe" 2a < /fcr 
u i 

where the first sum is now without restriction on i and/'; the upper 
sign is valid for bosons and the lower for fermions. This is the same 
as 

z = iz 1 2 ±i2 e_2a,/fcT 

We see that this differs from the expression Z = Zf, which one 
expects for distinct particles, in two ways. First, the factor ^ accounts 
for the identity of the particles: second, the situations where both 
particles are in the same state are emphasized for bosons and ex- 
cluded for fermions. 

The hot ideal gas — Gibbs' paradox resolved 

If we have JVidentical particles and ris so high that a N « kT, the particles 
will wander over many times N one-particle states, so that the main part 
of the partition function is obtained by considering only those states 
of the system in which each particle is in a different one-particle state. 
Since there is only one state with one particle in a,, one in a y and so 
on, while the expression Z X N names each such state N\ times (in 
every possible order of the indices i,j, . . .), this main part of the 
partition function is entirely included in 

±Z» 
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This expression includes terms in which several particles are in the 
same one-particle state. These must be supplemented for bosons or 
excluded for fermions, so we have 

Z = jr. Zf+ corrections for particle character 

At sufficiently high T the correction terms can be neglected, and 
we have 

1 ( 2nmkT \ sl " 2 „ 

z ~w.[-jr-) v 

Using only the leading terms In JV! ~ JV In JV- JV in Stirling's 
approximation, we have now for the entropy 

S N (V) = *ln V+^RlnT+Rln {^r)'* 

+%R-RlnN+R 

, V , „ , „, „ , /2trmk\ 3m , 5 _ 
= R In ^+f R In T+R In (-p-j + !*> 

where R = Nk — a very important result in agreement with the familiar 
thermodynamic behaviour of the entropy. 

We must now find out at what temperatures the correction terms 
begin to matter, and calculate their form. This part of the work is 
enormously simplified if we discuss first of all a more general 
statistical question than was previously considered. 

The grand partition function 

Devices which measure quantities of gas very seldom determine mass, 
or number of molecules, directly. Usually one measures pressure and 
volume at known temperature, and deduces the mass using an equa- 
tion of state. Even when a direct weighing is made, it is never so 
accurate that the number of molecules present is known to plus or 
minus one (or one million, if it comes to that). Thus no physical 
verisimilitude is really lost if we treat the number JV of particles in 
our physical system as a statistical expectation value just as we did 
with U. We can of course choose to discuss systems in which the 
number JV of particles present does obviously vary with time; for 
example, a small fixed volume in a region occupied by matter, where 
particles are continually wandering in and out. 

We consider then a system which contains a variable number JV of 
particles of the same sort. Let the distinct energy states of the system 



48 Elementary Statistical Mechanics 

when it contains N particles be e t (N), and suppose that they have 
associated with them probabilities Pt(N) such that 

i N 

The information carried by this set of probabilities is 
H = 22 Pi (N) In [p t (N)] 

i N 

and we now enquire what choice of the p t (N) minimizes this H 
subject to the further conditions 

2 1 NplN) = N 

i N 

22«i(*)pi(JV)= tf 

i N 

where the expectation values N and U are given. 

All we have to do is to introduce another Lagrange multiplier, and 
we find for the least informative set of p t consistent with the given 
conditions 

Pi (N) = E-ie™-**'"" 
where 

3 = V e vw-js«,W) 

i,N 

is called the grand partition function. We note that 

3 = 2e vN Z(JV) 

where Z(N) is our previous partition function for the system with 
exactly N particles. If we define Q. to be In 3, then 



and 



To re-establish connection with thermodynamics, we have to recall 
a little more of the thermodynamics than we have used hitherto. A 
useful thermodynamic function closely related to the Helmholtz 
free energy F(= U-TS) is the Gibbs free energy 

G = U+PV-TS 



N = 


a In 3 
8v 


80. 

~ 8v 


U = 


8C1 
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where P, V represent pressure and volume, and U, T, S are internal 
energy, temperature and entropy as before^ The chemical potential 
fi c of any component is denned by fi c = dG\dN c , where N c is the number 
of molecules of that component present in the system. If the system 
is made to grow at constant temperature and pressure by feeding in 
new material in such proportions that the concentrations of the 
various components remain constant, the p c remain constant as total 
mass and volume increase, so we see that G can be expressed as a 
sum over the components 

G = 2 JVcfc 

c 

and for a one component system 

G = Nix 

If two systems 1, 2 are brought into contact so that they can 
exchange both material and energy, the conditions of thermodynamic 
equilibrium between them are that, for all components, 

Pc, 1 = /*c, 2 

and that 

T^T 2 

If, in addition, there is no mechanical barrier between them (such as 
a stiff, permeable membrane) there is a condition of mechanical 
equilibrium P 1 =P 2 , but such barriers occur in many interesting 
situations, for example, as biological cell membranes. 

Reverting to the statistical picture, consider two systems charac- 
terized by JVi, U x and N 2 , U 2 , and the corresponding v ly & and v 2 , /? 2 . 
If we make a conceptual union of the two systems, the situation 

will occur with probability 

p[Nu elN x y, j\r 2 , ej (N 2 )] = exp [v 1 JV 1 +v a 2V a -/WM)-&«iflV's)] 

This probability is unaltered, on changing the conceptual union to a 
physical one in which matter and energy are freely exchanged 
between the two systems, only if it is equivalent to 

exp MJVi+i^-jShCiVO + eXJVa)]} 

for all N, e. This can only be true if Vl = v 2 = v, jS 1 =j8 2 =/9. Thus the 
statistical conditions v x =v 2 , j8 1 =j8 2 are together equivalent to the 
thermodynamic conditions i*i=ij.2, T x =T 2 . 
To identify v and more closely we consider how Q (=lnS) 
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depends on v, /? and the e. The change in O produced by small 
changes in these variables is 

,_ 8n aa .., v 80. . ,__ 

dQ = ^ dv+ -aJ dp+ Z^(N) d€m 

= N dv- U dy3— ^ 2 pffl) ck(tf) 

UN 

The form of this suggests that we look at the variation of Q. + up — Nv, 
which is then, from the above equation 

d(Cl + Up-Nv) = -vdN+fidQ 

where dg^is 2i,w e ( (N) dp t (N) as before. In particular, for a fixed 
value of N we have _ 

d(Cl + UP-Nv) = pdQ 

so j3 is an integrating factor for the heat increment dQ. Since the 
function Cl+Up—Nv is additive when independent systems are 
united (conceptually, or, if they are in thermodynamic equilibrium, 
by physical contact), p is 1 \kT as before. We then have 

d(£l + U/(kT)-Nv) = dS/k 

and since we already have 

PV+U-Ny> = TS 
we have 

1 



or 



1 IP \ Q 
kf(- p -' X ) = N- V 

Now,jt/AT and v differ only by a function of temperature. Since PV 
and N are independently variable at given T, for all systems other 
than a Boyle gas, the terms in the last equation are set independently 
equal, so that 

v = kf 
and 

PV 

" AT 

We have, finally, an alternative statistical expression of thermo- 
dynamics in terms of the grand partition function H, for which 

gpv/fcr _ 2 _ y e (i/fcr)Diw-e,w)] 

N.i 
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where p is the chemical potential, so that the mean number of 
particles present is 

O/X. 

and the internal energy 

din 3 



t/ = 



dp 



Non-interacting identical particles 

Since the statistical character of fermions and bosons only affects the 
behaviour of gases composed of them at temperatures and concen- 
trations where the forces of interaction between the particles are also 
important, it is unprofitable to consider gases of polyatomic mole- 
cules in this chapter. We shall consider only gases of one component, 
made up of atoms or of sub-atomic particles. 

Suppose that we have non-interacting particles at chemical 
potential /n and temperature T, moving in a containing field in 
which the energy levels of individual particles are a,. Since the 
particles are indistinguishable, a steady configuration of the system 
is completely determined by the set of numbers n f of particles in the 
set of levels a,. This whole set is conveniently denoted by n. The 
energy of the system in this configuration is then 

«n = 2 n i a i 
i 

and the total number of particles present is 
The grand partition function of this system is thus 

= 2 e x P [w 2 »t - 2 W| 

= Vn e (ni/krx»-a,) 
n < 

= 2ru n, = n(2'. n «) 

where t i =exp[(jj.—cc i )/kT]. The n over which the sum is taken include 
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only those partitions of the N which are permitted by the statistical 
character of the particles. For bosons, all partitions are admissible, 
while for fermions the n t can only be one or zero. It is then easy to 
see that for bosons 

s = no +*,+/?+•••) 



while for fermions 



Briefly, 



s = n(i+'<) 



3 = n(i+'.) ¥i 



where each upper sign is valid for bosons, and the lower for fermions. 
This convention will be used hereafter. 

An important probability in this situation is the probability /> ( (/! { ) 
of finding exactly n ( particles in level «,, irrespective of what is hap- 
pening elsewhere. This is evidently H -1 times the sum of all the terms 
in S for which the exponent of t f is % and we see by inspecting the 
structure of S that 

The expectation value <w ( > of n { is then at once, for fermions, 

< n <> = °TT?r itrl+irl 

Using the identity J,n=int n =t/(l — t) 2 , we find for bosons, 

<ni> = (T=^ (1-?i) = fo" 1 - 1 )" 1 

so that in general 

These distributions are the well-known distribution laws due to 
Bose and Einstein and to Fermi and Dirac respectively. The deriva- 
tion presented here has the surprising merit of being both honest and 
easy. 

The next step is to specify some potential energy which defines the 
a t , and calculate the consequences. 
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The ideal monatomic gas at chemical potential n 
As before, we consider a volume V within which the gas atoms move 
at zero potential energy, the potential energy increasing sharply to 
an effectively infinite value at the boundary. If the magnitude of 
momentum of a gas atom of mass m is p, its kinetic energy is p 2 /2m ; 
the substitution x=pjV2mkT, £=e B/fcr then gives (see p. 30) 

_ . „/2mJfc7Y' 3 f- x 2 dx 

N = *" v {ir) J 



U = 4nVkT 



pmfcn 3 ' 2 r 



C- 1 e*"qFl 
x 4 dx 



l e* a +l 
F= NiM-kTlna 

= N^±^rVkT^j^\ 3 ' 2 J" In (1 + £e-*V & 

Evidently the parameter £ is non-negative: we see further that for 
bosons, if £>1, the integrals have an unmanageable singularity at 
x=VJJJk~f, so that for bosons /i is necessarily negative; jn may be 
assigned any positive or negative value for fermions without trouble 
arising. 

Hot dilute ideal gas 

The expression for J7_given above serves to determine £ if the 
numerical density n=N/V and the temperature T are given. Evi- 
dently £ is less for larger T and lower n. If £ is small enough, the 
positive or negative unit in the denominator of the integrand is 
negligible compared with £ -1 and we may write 

_ JlnmkTV 31 * 

so £=nA 3 , where A = hl^/lirrnkT is the characteristic wavelength of 
the particle at T, i.e. £ is the number of particles in the character- 
istic volume. _ 

Correspondingly, U=%NkT, reproducing our previous result for 
the hot dilute gas with fixed composition N=N. 

Cold dense ideal gas 

For larger values of £, bosons and fermions behave very differently 

indeed, and require separate discussion. Their divergences from 
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high-temperature behaviour are called by the oddly chosen name of 
degeneration. 

Bose-Einstein degeneration 

We have seen that £ for a gas of bosons cannot be more than one; 
we consider first the limiting situation where £ = 1 . Then the numerical 
density is given by 

(2mkT\ 312 f<° x*dx 



. llrnkTV 12 f<° x 2 dx 
"< = M-H Jo e^I 



The integral is quite tractable by the device used in the section on 
black body radiation. 



f°° x 2 dx = r^f 
Jo e* 2 -l ~ Jo 1- 

oo /• CO 

n = l JO 



l dx 



-e"* 3 






4 m 



where £(f) is Riemann's ^-function and equals 2-612. 
Thus 

(2 imtkT \ 312 
h* ) 



k, = 2-612(- 



This result should alarm us, since £ is supposed to be determined 
by n and T and it now appears that £ cannot be determined at all if 
n > n,. The reason is that the sum over energy states which determines 
n is no longer properly represented by the integral. 
Consider the sum 

_ °° l 

N = J-, £-i e a <' fcr -l 

This can take values as large as we please, as £ tends to exp (a Q /kT) 
from below. For simplicity, let us take a =0, shifting the zero of the 
energy scale from the base of potential energy to the energy of the 
single-particle ground state. Then the total number n of particles 
in this ground state is 

n = (£- 1 -l)- 1 
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and if we choose £-* = 1 +(PN)~\ 0<j8< 1 

and we have a fraction j3 of all the particles present lying down dead 
in the ground state. 
The value of c^ is now 





Jrl y-213 
Am 


«1 
kT 


-"* T/--2/3 _ 

AmkT 



so 

9(n,F)- 2 ' 3 

and the total number of particles in the first excited state is 
1 

n l — r-i e or 1 ;jcr_ J 
1 



• (WV)- 1 +9(rt,F)- 2 ' 3 
= i("i*0 2 ' 3 for large JV 

since n { Vis of the same order as N. 

Thus « x ~m 2/3 » and moreover « x is already well enough represented 
by its value for £= 1. 

We see, then, that if the number n of particles per unit volume is 
given, there is a critical temperature T c defined by 

n = n t (T c ) 

and that for T< T c we have n,(T) particles 'live' and distributed over 
the excited states and a finite fraction, amounting to n-n^T) par- 
ticles, 'dead' in the ground state. 

The mean energy of the live particles is easily calculated by the 
same method used for n„ but now involves the Riemann £(f) which 
is 1-341. The mean energy, per particle, of the live particles is then 

e = 0-5134 xffcr 

and C/ocr 5 ' 2 forr<r c . 

A good deal of labour has been devoted to calculating the be- 
haviour of the Bose-Einstein gas near and below the degeneration 
temperature, with a view to accounting for the superfluidity of liquid 
He 4 , whose atoms are bosons. It is now evident that an explanation 
of superfluidity must take account in some detail of the actual forces 

3 
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of interaction between the atoms, which are not at all small in the 
liquid. However, the isotope He 3 , whose atoms are fermions, does 
not form a superfluid in any conditions yet attained, so the statistics 
are clearly important as well. 

Fermi-Dirac degeneration 

In a gas of fermions £ can take any value, and there is no special 
physical interest in the point £= 1. If £ is large compared with one, 
the necessary integrals can be reduced with only moderate exertion 
(see Appendix, p. 142), to a power series whose first terms are 

f Mn dx = J>) d *+iV(*H^/» + • • • 

Taking the first two terms, we now obtain the numerical density 

where 

C = 4n(2mjh 2 ) 312 
and 

I = ferin { 

a factor 2 having been included since every energy level may be 
occupied by two particles of oppositely directed spin (for spin i). 

C = 3-4 x 10 39 gm- 3 ' 2 cm" 6 sec" 3 

for particles having the mass of an electron. (The new symbol | 
appears where we should have expected fi. We shall discuss this 
later.) 

Given n and T, this relation can be inverted to find f by successive 
approximations. 

The first approximation 

3«\ 2 ' 3 



m 



is the value of | exact at T = 0. The second approximation — which 
is as far as we can proceed without including more terms of the 
original series — is 
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The corresponding value of the internal energy per particle is 

5 w'JWX 



' = u{ 



l+; 



12 a 

and the specific heat per particle is therefore 

_ TT 2 kT 3, 



These expressions are in any case valid only for large f, i.e. for 
i»kT, and we see that f differs from £ only by the square of the 
small quantity kT/ij . In addition, the specific heat is reduced below 
the equipartition value by a factor roughly 3kT/g - 

In the degenerate fermion gas, all states which he much more than 
kT below the level f are fully occupied, and all much more than kT 
above f are empty. By a natural analogy the region of full states is 
called the Fermi sea; the surface in momentum space corresponding 
to energy | is called the Fermi surface. This surface is a sphere for 
the gas of perfectly free particles. 

The Fermi fluids 

There are two important classes of substance whose physical be- 
haviour approximates closely to that of the degenerate ideal gas of 
fermions. The first is the system of conduction electrons in a metal; 
the second is the nuclear fluid of which the nuclei of atoms are 
composed. In each case, the forces of interaction between the particles 
are very large: however, they apparently average out to provide a 
potential well equivalent to the box confining an ideal gas. 

This effect can be partly understood in terms of a familiar analogy. 
When a floating body is set in motion in even an non-viscous liquid, 
the kinetic energy associated with a given speed is greater than the 
kinetic energy of the floating body alone, since the liquid is also set in 
motion. The detailed calculation of the energy in the liquid is 
generally difficult, but there will be a range of speeds over which the 
flow pattern is unchanging, except that the liquid flow speed is 
proportional to the speed of the body. In this range, the kinetic 
energy of the whole system remains proportional to the square of 
speed, with an effective mass of the order of half as much again or of 
double the mass of the body. 

In a similar way, the lower states of excitation of a Fermi fluid are 
well described in terms of a 'quasi-particle' which can be visualized 
as an electron or nucleon moving through the sea of neighbours, 
along with the connected flow of these neighbours 'cut away before 
and closing from behind'. Because of the correlated displacement of 
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the neighbours, the quasi-particles interact with each other much less 
vigorously than do the 'bare' particles in absence of the neighbours. 

Conduction of electrons in metals 

Once electrons had been identified, towards the end of the last 
century, it was possible to proceed with some confidence to construct 
theories of the optical and electrical behaviour of materials in which 
electron movements played the central role. Such theories had con- 
siderable success, but all implied a fundamental difficulty: if electrons 
can move independently of the atoms to which they are (more or less 
firmly) attached, then by the well recognized principle of equi- 
partition they should contribute to the specific heat. But, even in 
metals, Dulong and Petit's law is valid, only the displacements of the 
atoms as a whole apparently contributing. 

This difficulty disappears if the electrons in a metal constitute a 
Fermi gas which is highly degenerate at ordinary temperatures. To 
obtain a simple model we first of all abstract from the actual cry- 
stalline structure of the material by replacing the ions, which have 
released the conduction electrons, by a uniform distribution of 
positive charge; we then suppose that the effect of this charge on the 
conduction electrons is to mask their mutual repulsion and also to 
provide a potential well of uniform depth for them to move in. All 
these steps involve some falsification, but the resulting model, due to 
Sommerfeld, is surprisingly successful. 

One metal contains 3 x 10 22 atoms per cc. The value of f then, is 



/3n\ 2/3 / 9 x 10 22 \ 2/3 
\2c) ~\2x3-4xlO s9 j erg 
~ 6x 10 _12 erg 



if we have one conduction electron per atom. This is about 4 eV, 
whereas kT at room temperature is ^ eV, so these electrons will be 
contributing about ^o of their equipartition specific heat at room 
temperature. 

At temperatures far below the Debye temperature, however, the 
specific heat of the crystal lattice vibrations will be varying as T 3 , 
whereas that of the conduction electrons varies as T, so that for 
r/0 D less than about (fc0 D /fo) 1,2 /21 the electron specific heat is the 
greater of the two. This is verified in practice. 

Extending the model 

As a result of the regular crystalline structure of a metal, electrons 

with the appropriate momenta experience Bragg reflection within the 
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metal, and cannot propagate freely. In consequence the energy and 
momentum are related much less simply than by E = p 2 /2m. The 
new relation can be represented by making m a tensor function of k, 
but fortunately we need only worry about the effect on the density 
of states as a function of energy. In particular, for a thoroughly 
degenerate Fermi gas, all we need know is that density and its first 
few derivatives at f » in other words, at the surface of the Fermi sea. 
Suppose that the number of distinct energy states between E and 
E+dE is g(E) dE before allowing for electron spin. Then each such 
state may be occupied by two electrons of opposite spin, and the 
true density of states is 2g(E). We then have the relations 

N _,f°j;(E)dE_ 



Jo e«-«' kT +l 
u=2 ^E g (E)dE 



e (E-«/kT_|_l 

which can be reduced, taking E\kT as variable and using the ex- 
pression on p. 142, to 

N = 2J o g(E)dE+^Y-g'(0 

U = 2J'e g(E) dE+'^^p- (gtf) + I g'(D) 

including only the first two terms of each series. 
The value | of £ at T = is given by 



N = 2 V° g(E) dE 



so we have in the next approximation for £ 

, , ^TVGro) 
6 " ^° 6 «<&) 
The specific heat is then given in first approximation by 

^ = fr 2 k z Tg(to)ii+o[(km n] 

Nuclear fluid 

The protons and neutrons of which atomic nuclei are composed are 

Fermi particles of spin \. It is found that atomic nuclei have a nearly 
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constant density (about 3 x 10 8 tons per cc), and a nearly constant 
binding energy per particle. The simplest view of these nuclei is then 
to treat them as drops of a homogeneous fluid, sometimes called 
Wignerite in recognition of E. P. Wigner's development of this idea, 
and, to begin with, to regard Wignerite as a degenerate Fermi gas. 

Wignerite contains equal proportions of protons and neutrons, and 
is not electrically neutral, so that large drops of it are blown apart by 
their own electric charge. This is why nuclei of mass number above 
about 240 become unstable against fission. Since protons and neu- 
trons differ in nature although they have almost the same mass, the 
exclusion principle still permits a given state of translatory motion to 
be occupied by two protons of opposite spin and two neutrons of 
opposite spin. Thus the depth of the Fermi sea, f > in Wignerite of 
total numerical density p is 

?0 ~ 8m \2W 

where m is the neutron or proton mass. 

For a spherical nucleus of mass number A the radius r is r A 113 , 
where r is found by electron scattering experiments to be about 
1-1 x 10" 13 cm. £ is proportional to r ~ 2 and, for the given value of 
r , is 40 MeV. The energy, other than electrostatic, of removing a 
nucleon (proton or neutron) from the average nucleus is 8 MeV. 
Thus the Fermi level corresponds to a binding of this amount and 
the nuclear fluid can be represented as a degenerate Fermi gas in a 
potential well of depth 48 MeV. This picture leads to a good first 
approximation to the structure and behaviour of atomic nuclei. 

The first excited states of most nuclei lie so high as to have neg- 
ligible probability of thermal excitation, except in the very high 
temperatures obtained in nuclear explosions or some electrical dis- 
charges. Thermodynamic equilibrium is only approximately attain- 
able in transient conditions, but the idea of equilibrium can be useful 
even in the very short time-scale of a single nuclear reaction. 

'Temperature' in nuclear reactions 

While a good deal can be found out about the shape and charge 
distribution of atomic nuclei by ordinary spectroscopy, that is, from 
their action on the surrounding electrons, most of our information 
about their internal structure and motion comes from bombarding 
them with various fast particles and looking at what comes out. A 
good deal of what happens can be analysed in terms of a direct 
interaction between the incident particle and some one particle, or 
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small group, in the nucleus, but there are many events in which the 
incident particle seems to enter the nucleus and share out the whole 
of the available energy, forming a compound nucleus at a high level 
of excitation. This compound nucleus then loses one or more particles 
by a process analogous to the evaporation of molecules from a hot 
liquid drop. 

The available energy will consist of the kinetic energy of collision 
(in the centre of mass reference system) plus the binding energy, 
which is 8 MeV if the incident particle is a nucleon. If this is shared 
randomly there will be a certain average energy per nucleon which at 
once enables us to define a corresponding temperature for the 
Wignerite of the compound nucleus, and to predict the corresponding 
Maxwell-Boltzmann energy distribution for the particles which 
ultimately evaporate away. This is verified in many experiments. 

We should take note that information is being disregarded in this 
operation. In the course of a reaction of this sort, total energy and 
total angular momentum are certainly conserved. Conservation of 
angular momentum leads to certain angular correlations among 
incident and emitted particles; these we are neglecting. Conservation 
of energy leads to strong restrictions on the energy of the emitted 
particles— for example, if only two particles escape, and there is no 
radiation, the energy of one implies the energy of the other with an 
uncertainty which is no more than the uncertainty of experimental 
measurement. Thus, merely to state a temperature implies that we 
know less than in fact we do. Nevertheless, this picture has been of 
real use in the analysis of experimental results. 



HETEROGENEOUS SYSTEMS 



The physical systems for which we have so far calculated thermo- 
dynamic functions are all of the very simple sort which consist of a 
single component in a single macroscopic state. As an introduction 
to heterogeneous systems, let us consider two dissimilar metals in 
contact, where the condition of equilibrium is equality of chemical 
potential of the electrons in the two pieces of metal. 

Contact potential 

The potential energy, and with it the chemical potential, of the elec- 
trons in a piece of metal can be shifted up or down by varying the 
electrical potential of the metal. Let us take the zero of our energy 
scale to correspond to an electron at rest in air at earth potential. If 
the depth of the potential well representing the metal is W, we see 
that the chemical potential of the electrons in the metal at electrical 
potential V, with respect to the chosen zero, is 

ix = i+eV-W 

If now we consider two pieces of metal put into electrical contact 
so that they can exchange electrons, then in thermodynamic equili- 
brium 

Mi = H-2 

i.e. 

£ 1 + eV 1 -W 1 = £ 2 +eV 2 -W 2 

Note that in these expressions e must be taken as the physical 
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Fig. 2. Fermi level and potential energy distribution for an elec- 
tron in and near a metal. 



charge of the electron, that is, a negative quantity. Thus the situation 
in the figure corresponds to a negative value of V. 

The potential energy JTis a function of the density rather than direct- 
ly of temperature; at constant pressure 8 W/dT is small and negative. 

The contact potential V^ — V 2 will then depend on temperature 
mainly through £ 2 —£i, and will vary parabolically with T, with a 
coefficient depending on the relative rate of variation of density of 
states with energy at the Fermi surface in each metal. 

Notice that this effect, which is not easy to measure, is not the 
same as the ordinary thermo-electric effect; the latter is due to a 
dynamical coupling between the flow of heat and the flow of elec- 
tricity, and this cannot be properly discussed in terms of equilibrium 
theory. 1 

As usual, there is a limitation of time-scale here. We have supposed 
that the metals are free only to exchange electrons; over a very 
long period they would of course exchange atoms as well, and the 
ultimate equilibrium situation would contain two lumps of alloy (not, 
in general, of identical composition) such that the chemical poten- 
tials of the electrons, and of each sort of metal atom, were each 
equal in both lumps. Of course, the chemical potentials of the dif- 
ferent species would not in general be equal to each other. 

1 Kelvin did as a matter of fact derive correct results in thermo-electricity 
by a judicious use of equilibrium thermodynamics. His work is now rightly 
regarded as a foreshadowing of the recently developed subject of irreversible 
thermodynamics. 
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This equilibrium between two pieces of solid material is a special 
case of the problem of phase equilibrium. A phase is a connected 
region, containing material of uniform or continuously varying 
density and composition, bounded by a surface at which density or 
composition change discontinuously. Solid, liquid and vapour of one 
material, for example, will constitute distinct phases. In a physical 
system containing several phases, the conditions of thermodynamic 
equilibrium are uniformity, throughout the system, of temperature, 
pressure and the chemical potential of each component. We shall now 
use these conditions, with the results of previous chapters, to study 
some further equilibria of interest. 

Thermionic emission 

All metals, when heated in a vacuum, become surrounded by a cloud 
of negative charge which consists of electrons evaporated out of the 
metal surface. It is important for the design of thermionic devices to 
know how the density of this cloud depends on the metal and on the 
temperature. The distribution of electrons at some distance from the 
emitting surface depends on the particular electric fields which act 
on the system, but their concentration just outside the metal surface, 
provided no net current is flowing, is given simply by the equilibrium 
condition that the chemical potential of the electrons is the same 
inside and outside the metal surface. 

In terms of the Sommerfeld model which was introduced in the 
last chapter, we have to equate the chemical potential of a dense 
electron gas at low potential energy with that of a dilute electron gas 
at zero potential energy. The metal atoms we regarded as a fixed 
framework, and their thermal behaviour thus drops out of the 
problem. We are left with two phases, the interior and exterior of the 
metal, and one component, the electrons. The chemical potential in 
each phase is then simply the Gibbs free energy per electron. 

For the dense electron gas inside the metal, the chemical potential 
lies i above the potential energy level. With respect to the zero of 
potential energy, which we choose outside the metal surface, the 
chemical potential is then 

lx .= -W+£ = -<f> 

and since g»kT, this depends little on temperature. The chemical 
potential of the electrons in the metal is in fact nearly the total 
energy of the most energetic electrons at absolute zero : since the 
electronic specific heat is so small, the effect of entropy in reducing 
the chemical potential is also small. 
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For the dilute electron gas outside the metal, the situation is quite 
different. The Gibbs free energy is 

G = U+PV-TS by definition 
giving 

[i = ikT-TS per electron 
i.e. 

,*= -fcr[lnt>+iln^p^+ln2] 

where v is the volume per electron and we have used the expression 
derived before for a dilute monatomic gas. The extra term A: In 2 in 
the entropy arises from the electron spin, whose two values give a 
total number of quantum states just double the number obtained by 
considering the translatory motion alone. Since Wis generally much 
less than <j>, the important variable is v and the equilibrium condition 
is obtained essentially by making the density of electrons outside the 
metal so low that TS = <f>. More precisely, we have on equating the 
values of p 

2v(^) 3 ' 2 = +** 

giving for the numerical density n of electrons just outside the metal 
surface 

and for the pressure which these exert 

p = (2fcD 5 ' 2 (^) 3 '%-^ r 

Since the electrons are charged, the electron cloud produces an 
electric field, which can be measured, with some difficulty, to deter- 
mine n. It is easier to measure the thermionic current which can be 
drawn away from the metal, as in the ordinary diode valve, by an 
external electric field. This current saturates as the applied field 
increases, reaching a maximum density (current per unit area of 
emitting surface) given by 

i = AT 2 e-" IT 

This is connected with the previous expressions by the following 
argument from kinetic theory. 

The number of particles striking unit area in unit time from an 
ideal gas at pressure p and temperature T is 

Q.7rmkT)- 1! *p 
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Thus the number of electrons falling back on to the metal surface 
from outside per unit area and time, is 

J h 3 

In the equihbrium situation there is no resultant current away from 
the metal, so the flux of electrons away from the metal must also be/. 

Suppose for simplicity that there is no reflection of electrons on the 
metal surface. This is reasonable since an electron of charge q at a 
distance r outside the metal surface experiences an 'image force' 
attraction ^ 2 /(4/- 2 ) which accelerates it towards the surface, and it is 
plausible that it will usually lose the kinetic energy thus acquired to 
the conduction electrons in the metal, and have to await thermal 
reactivation before it can again escape. (In view of the physical 
identity of electrons, we express the assumption more properly by 
saying that the arrival and escape of electrons are uncorreiated 
events.) Then the flux of electrons away from the metal is controlled 
only by conditions in and at the surface, and is in particular indepen- 
dent of what is happening in the electron cloud outside the surface. 

An external field drawing electrons away from the surface will then 
decrease the back current of electrons condensing out of the external 
charge cloud, without affecting the outward flux, and the maximum 
current density out from the metal surface will be 



qf 



&3 



H 



in agreement with the empirical formula. If some incident electrons 
are in fact reflected at the metal surface the value of A will be de- 
creased below that, 4Trmqk 2 h~ 3 , given by this calculation: otherwise 
it will depend on the mass and charge of the electron but not on the 
particular emitting surface, the properties of the material entering 
only in the determination of <j>. 

The idea used in the last few paragraphs, which is essentially that 
an equilibrium situation is often profitably described in terms of the 
balance of two opposite fluxes, is often useful in statistical theory, 
especially when one wishes to proceed to discuss the non-equilibrium 
situation as well. We shall meet several other applications of it. 

The vapour pressure of a solid 

For simplicity, consider first the equilibrium between one of the 

noble gases and its solid form. There are then no complications such 
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as arise from the internal molecular motions of other substances, and 
the chemical potential of the gas has its simplest form 



-kThnv+^ln h2 j 



so long as the density is low enough for the gas to be nearly ideal in 
its behaviour. If that is so, the density of the solid is so high in 
comparison that the PV term in its Gibbs free energy can be safely 
neglected, that is, we calculate only the Helmholtz free energy. 

This was not worked out above, but it is an easy exercise for the 
reader to proceed either directly from the partition function or, from 
the internal energy which we have worked out, via the specific heat 
and entropy, to obtain the integral which represents the free energy. 
This needs to be computed numerically for intermediate tempera- 
tures, but we can write down good approximations for low and high 
temperatures, obtaining for y., with an Einstein model and charac- 
teristic temperature E 

-0+ikE^ 3kTe ' E ' T ' T<<E 
V + 2**1 + 3kT In (EjT), T» E 

and with a Debye model and characteristic temperature D 

SLKnf-MTe-^-WSkTiD- 3 , T«D 
- Q + * K1J \+3kT]Ti(DIT)-kT, T»D 

where Q is the energy of cohesion per molecule — in this case, per 
atom — and is further considered below. 

Vapour pressures are well enough fitted for most purposes by 
using the Einstein model and the high-temperature form for the free 
energy. Equating the two expressions for /* in vapour and solid we 
obtain 

P ~ (fcT) 1 ' 2 e 
where hv=kE. 

We must consider the meaning of Q in more detail. Its definition 
is that the potential energy of a piece of solid, if all its N atoms were 
at rest in their equilibrium positions, would be —NQ compared with 
the energy of these N atoms when all widely separated from each 
other. It is not the same as the latent heat of evaporation, to which 
there are other contributions. First of all, each atom in the solid has 
a base energy, 3kE/2 above the energy it would have if it could be 
reduced to rest, due to the zero-point motion: secondly, in its thermal 
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oscillation it has an additional mean potential energy of 3kT/2, 
provided the temperature jfis much greater than E. Thus the internal 
energy of evaporation is Q—(3kE/2)—(3kT/2), while the total latent 
heat of evaporation is L=Q—(3kE/2)—kTI2. 

To verify that our working is correct we may compare the result 
with the corresponding expression deduced from Clapeyron's purely 
thermodynamic relation 

dp L 

dT ~ TAv 

giving the dependence of equilibrium pressure on temperature for 
two phases with a heat of transformation L and corresponding 
volume change At;. Under the same assumptions as were made at the 
start of the statistical treatment, Av, the volume of vapour, is equal 
to kT/p for each molecule, and L is L — \kT (=L +pAv—3kT/2), 
so 

1 dp L -kT/2 
p dT kT a 

and 

lnplp = L lkT -L /kT+^ln(T IT) 
so 

p = p Vl\JTe L ° lkT o e-vw 

giving the same dependence on T as the previous formula. The ad- 
vantage of the statistical formula is to relate the arbitrary p , T to 
the independently measurable quantities m, v. 

In general, if ju. c is the chemical potential of the condensed phase 
the equilibrium vapour pressure will be 

(O-n-mkTX 3 ' 2 

where A/a is the excess of the vapour phase chemical potential over 
the simple expression given above. This excess will generally consist 
of several distinguishable parts. One of these, which will be con- 
sidered later on, will arise from molecular interactions, corresponding 
to the deviation of the vapour, at higher pressures, from the ideal 
gas condition. Others arise from the fact that various degrees of 
freedom, such as rotation, internal vibration, or electron spin, of the 
molecule have different ranges of movement in the condensed and 
vapour phases. Let us separate the contributions p' e , A/*' to /x c and 
A/x respectively due to these degrees of freedom. Then 
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where Z„ and Z c are the canonical partition functions per molecule 
in the vapour and condensed phase respectively for the degrees of 
freedom concerned. 

The simplest example is the vapour of an alkali metal. An atom of 
such a metal has a resultant electron spin angular momentum of 
A/47T, which can have two orientations with the same energy in the 
vapour. In the metal, the electrons couple to give zero net angular 
momentum, and so, one single state. Hence, as in the thermionic 
evaporation of electrons themselves, we have Z„/Z c =2, and the 
vapour pressure is twice what one would calculate without allowing 
for the electron spin. 

Again, a polyatomic molecule, such as that of alcohol, may rotate 
freely in the vapour phase, but be restricted to a narrow range of 
angular oscillation in the solid or liquid. For methyl alcohol, the 
corresponding Z V \Z appears to be about 20. 1 

Evaporation and condensation 

It is instructive to see how the same vapour pressure equation can be 
obtained from a kinetic model in which we calculate the rates of 
condensation and of evaporation and consider the equilibrium pres- 
sure at a given temperature as determined by the equality of these 
two rates. As before, we simplify by neglecting the possibility of 
reflection for the first round of discussion. 
The rate of condensation is again given by 

/ = (2irmkT)- m p 

The rate of evaporation is very hard to calculate with a decent 
degree of rigour, since that would involve a complete discussion of 
the lattice dynamics of a crystal at the free surface. This has never 
been done. To obtain a tractable problem we at once adopt the Ein- 
stein model, and truncate the question of how energy is transferred 
among the surface molecules. 

A molecule constrained to harmonic oscillation in one dimension 
will, because of the even density of its states in energy, have a 
probability exp (- W/kT) of having a total energy of oscillation of 
W or more, at temperature T. If we suppose that the sharing of 
energy among the surface molecules is so rapid that the energy of 
a given molecule fluctuates widely inside its period of oscillation, the 
energy distribution for a surface molecule which happens to be 

1 This is calculated from the observed vapour pressure and probably in- 
cludes a contribution from a large change in the vibrations associated with the 
O — H bonds. 
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moving outwards will be determined by the oscillator potential in 
which it is moving, regardless of the possibility of escape which is 
opening before it. It will then move outwards v times per second with 
a probability exp (— W/kT) on each occasion of having more than 
the energy W needed for escape. The probability per unit time that it 
should escape is therefore v exp (— W/kT). One is tempted to put W 




Normal displacement 



Fig. 3. Variation of potential energy of a surface molecule with 
displacement normal to the surface, a, along the normal through 
its central position, b, along an off-centre normal. 



down as the internal heat of vaporization, but this is a little too easy. 
Besides, it gives the wrong answer, so we must think harder. 

Detailed study of the geometry of evaporation from a crystal leads 
to a series of interesting complications which we cut through with the 
assumption that all molecules evaporate from a typical situation, 
such that the energy required to remove a molecule from its equili- 
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brium position in this situation is equal to L Q . However, the molecules 
vibrate in the plane of the surface as well as normal to it, and a mole- 
cule which has some lateral displacement already has a corresponding 
potential energy, so that the further energy which it requires for 
escape is reduced below L . In fact, a molecule displaced r against a 
force which makes it oscillate with frequency v has a resulting 
potential energy 2n 2 v 2 mr z , and the further energy it needs in order 
to evaporate is L Q —2ir L v 2 mr 2 . 

The probability that a molecule should have a sideways displace- 
ment between r and r+dr will be proportional to the area 2w dr of 
the ring in which it lies and to the Boltzmann factor exp [ - 2rr Vwr 2 / 
kT] so must be 

27rr e - 2 * 2 * 2 "»- 2 ' fcr dr 



r 



2nr e- 2n2v2mr * lkT dr 
lo 

since the probability summed over all r must be unity. Of course, the 
harmonic force law breaks down for r nearly equal to the molecular 
spacing, but unless kT is very large this only affects regions where the 
integrand is already negligible. The integration is easy, giving for this 
probability 

2t m e -2jj2v 2 mr2/kr 2-nr dr 

KJ. 

Now the probability of escape per unit time, in the direction normal 
to the surface, for a molecule with sideways displacement r is 

v e~ L o IHT e + 2n 2 v 2 mr 2 /fcr 

so the total probability of escape per unit time for a given molecule is 



2m> z m 



[■ 



e - L o lkT \2irrdr 



kT 

where J 2ttt dr is just the area A of surface occupied by the molecule. 
Thus the rate of evaporation per unit area is 

2^ 3 w LlkT 
kT 

and equating this to the condensation rate 

(2nmkT)- ll2 p 
we obtain for the equilibrium vapour pressure 

(2nmr 2 LolkT 

p (kT) 112 
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an expression identical with that we derived before, using equilibrium 
statistics. As in the discussion of the thermionic current, the kinetic 
argument depends on much more detailed assumptions about the 
process than does the argument from equilibrium statistical mechan- 
ics. However, the kinetic argument enables us to estimate the rate of 
condensation or evaporation when the pressure is maintained above 
or below its equilibrium value, provided we know the surface 
temperature. 



SOME PROBLEMS 
BASED ON LINEAR LATTICES 



A further important group of soluble problems is concerned with the 
statistics of linear arrays. They are most directly approached through 
an attempt to put more realism into the model of a rubber molecule 
discussed in an earlier chapter (pp. 27-29). To start with, we extend 
the treatment of the earlier model. 

The 'ideal rubber' was dealt with before by evaluating the free 
energy of the model chain with its ends fixed, and the mean tension 
in the chain deduced as the derivative of free energy with respect to 
chain length. An alternative procedure is to build a source of tension 
into the system being analysed, for example by considering a chain 
hanging vertically with a weight W on the end. Then a configuration 
of the chain, of nett length La, will have energy - WLa, the potential 
energy of the weight, associated with it, so that the partition function 
of the whole system, chain plus weight, becomes 

2Z^L)e ffLaW = Z«W 

L 

where L ranges from - N to + N over all integral values of the same 
parity as N. Since Z N (L) is simply a binomial coeflScient we have 

Zll {W) = [ e ™*r + e- Wo "'T 
= [2 cosh (Wa/kT)] N 

Note that since the chain is allowed only a discrete set of fixed lengths, 
there is still no way to build the kinetic energy of thermal motion of 
either chain or weight into the calculation. 
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To the extent that the corresponding factor of the total partition 
function, in a more realistic model, may well be independent of L, 
we shall expect to obtain the right equilibrium length by considering 
probabilities proportional simply to Z N (L) exp (WLa/kT). 

Problem: Evaluate the expectation values of L and L?, given N, W. 

The potential energy of the system can, if we wish, be associated 
with the links of the chain rather than with the weight, thus we obtain 
exactly the same Z N (W) if every downward pointing link is given an 
energy E ~ equal to — Wa and every upward pointing link an E + 
equal to + Wa, since reversing a link hoists W through a distance 2a. 

The important addition to the model is now to associate an extra 
energy with the joints of the chain to allow for a certain resistance 
to bending. In order to gain freedom for further applications of the 
new model we in fact introduce distinct energies E + + associated with 
the joint between two links of + type, E*~ with two links of op- 
posite type, and E~ ~ with two links of - type 

The statistical discussion now obviously becomes more compli- 
cated, for the potential energy of the system depends not only on the 
total length of the chain but also on the number of points at which 
reversals occur. 

This extended model is effectively identical with that introduced by 
Ising, and since known by his name, for the discussion of para- 
magnetic susceptibility. Let us state the problem, then, in its abstract 
form. 

The Ising problem in one dimension 

A linear chain of N sites has its state specified when one of two 
alternative states is assigned to each site. A + state has energy E + , 
and a - state has energy E~, associated with it. Energy E ++ , E~~, 
E + ~ is associated with each pair of neighbouring sites according to 
the states assigned to the sites of the pair. The pairs + — and - + 
have the same interaction energy E + ". 

The number of states of the whole chain for which N + , the 
number of + states, and N B , the number of + — neighbour pairs, 
are given, can be found by a direct enumeration of possibilities. 
There is a very neat alternative which is worth using because it fore- 
shadows powerful general results in the study of systems which obey 
linear mathematical equations. This approach starts by considering 
the relation between the partition function for a chain of N sites and 
that for N+ 1 sites. 
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Let the extra site be added at the top end of the chain, regarded as 
vertically extended. (Of course, any epithet which ensures that we 
know one end from the other will do.) Since the top site of the 
JV-chain may be either + or — , as may the added site, there are 
four situations to consider: 



+ 







Fig. 4 The four possible configurations of two terminal sites on 
an Ising chain. 

Any detailed analysis of what goes on in the lower part of the chain 
leads to unprofitable complication at this stage. Instead, suppose that 
the partition function Z N for the TV-chain is written as 

Zx — Z N +Z N 

where Z£ is the p.f. for a chain whose top site is known to be in a 
+ state, Zj? for a chain whose top site is known to be in a - state, 
and consider the relation of the Z + and Z" for chains of different N. 
Z$ +1 is a sum over the states which appear in situations a and c 
above. Correspondingly, Z£+ 1 is obtained from situations b and d. 
Thus 

Z$ +1 = Z5 e -<fi + +« ++ )"^ +z - e -< B+ +* + -w 
Z^ +1 = Zjj e-v- +E+ -» kT +Zz e -a-+E--mT 
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so that Z£±\ are linear combinations of Z^- _ with coefficients 
which do not depend on N. IfZ$, Za are considered as components 
of a vector quantity, the above pair of equations can be written 
formally 



Er]-""E] 



where 

r e -j3(B++£++) g-/S(E+ +B+-)-| 



[M] 



[ e -fi(B++£++) q-B(E+ +E+-)-l 
e -0(E~ +E+ ~) q-S(E- +£"-) 



and = 1/AT. The equation is now in matrix form, but for our 
purpose this structure can simply be regarded as shorthand for the 
preceding pair of equations. The new notation has the advantage of 
directing attention to the operator [M] as much as to the Z. If we 
interpret [M ]" as the performance of the operation [M], n times in 
succession, it is evident that 

r e -«£+-] 

= W [e-H 

Again, if we interpret a[M] as 'following the operation [M] by 
multiplying both results by a\ and [M/a] as the operator obtained by 
dividing every element of [M] by a, it is evident that [M ] = a[M/a], 
so that the particular [M ] which concerns us can be written 

e -i6(.E+ +E- + E + + +E~ ") 



r e -«(E+ -E- + E + + -E- ") g-««+ -E- +2B+ ' 



-MCE" -£ + +2B + 

or 



1 



[M] 



where 



f 6 ce-^T r , 



-In a = £|S(£ + +JJ- +£ + + +E~ ") 
-In 6 = iP(E + -E- +E + + -E- ") 
-lnc = ip(E + -E-) 

A =E + --i(E + + +E-~) 

Then [M] N =a N [M'] N , and the symmetry of [AT ] makes it a slightly 
more tractable object than [A/]. 
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It is clear that if a pair of numbers x, y exists such that 



then 






Let us examine this possibility. 
Evidently 



[M 
is the same relation as 



iM 






which when written out gives two distinct equations determining x/y. 
If the two values of x/y coincide, the equations are consistent, and 
the condition for this is 

{b-mXfr l -m) = e- 2 " x 

that is 

[m-lib+b- 1 )? = e-^+i^-fe" 1 ) 2 

or 

m = ■Kft+Z>- 1 )±[i(6-&- 1 ) 2 +e- 2 * A ] 1 ' 2 

so the relation can be satisfied for two values of m and two corre- 
sponding ratios of x to y, which are in fact given by 



ce" 



m—b 



If we call the larger and smaller value of m, m x and m 2 respectively, 
and call the corresponding values of x/y, xjy! and x 2 /.y 2 respectively, 
the choice of actual values of x x and x 2 is arbitrary, but y x and y 2 
are then defined. Having chosen x lt x 2 we can then write 

Zi =fx 1 +gx 2 

Zf =fyi+gy 2 

and determine /and g. Notice that since x 2 ly 2 is negative, whereas 
Zt\Zl is positive, /is never zero. 
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Then 






so 



Z N + 1 = a N [m% ffa+yj + mg g(x 2 +y 2 )] 

If we are interested, as is usually the case, only in the limiting 
behaviour of the system for large N, then the fact that m 1 > m 2 leads 
to the very simple result 

lim N~ 1 \nZ N = In (am^ 

N-><o 

Applications 

The Ising problem can be interpreted as representing several distinct 
physical situations, by suitable choice of the energy parameters 
appearing in it. 

(i) Ideal rubber. As indicated above, we set E + = Wa, E~ = 
- Wa, and it is reasonable to set E + + =E~ ~=0, E + ~=A. In this 
case, then 

a = 1, b = c = e- WalkT 
and 

m x = cosh(Wa/kT) + [e- 2A ' kT +smh. 2 (Wa/kT)] 112 

(ii) Magnet. If we now consider a chain of magnetic dipoles in 
an external field H, permitting the dipoles only the alternative 
alignments parallel or antiparallel to the field, the energy of reversing 
one dipole by itself is E + -E' =2mH, where m is the moment of the 
dipole. The physical interaction of the dipoles with each other 
includes of course the effect on each of them of the magnetic field 
due to its neighbours, but this is completely dominated in ferro- 
magnetic or antiferromagnetic substances by an electrostatic inter- 
action between the magnetic electrons on neighbouring atoms. This 
interaction depends indirectly on whether the electron spins are 
parallel or antiparallel, and the electron spin on each atom deter- 
mines its magnetic moment. A not unreasonable model for the real 
magnet is then obtained by setting£ + + =E~ ~ = -J, E + ~= J, where 
/ is positive for a ferromagnet, in which neighbouring spins set 
parallel at low temperatures, while it is negative for an antiferro- 
magnet, whose ordered state at low temperature contains anti- 
parallel neighbours. 
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(iii) Lattice gas. The adsorption of gas molecules on a solid 
surface, or of solute molecules along a linear polymer chain, gives 
rise to situations which can be represented in terms of an Ising model. 

Consider a long-chain molecule which provides N similar sites for 
the attachment of a smaller molecule whose chemical potential is p. 
For simplicity, suppose that no other chemical species is present. 
The grand partition function for the adsorbed molecules is then 

2 e w+ » /fc V + i( N+ > s > N) e' w+E+sB)lkT 

N+=0 

where N + is the number of adsorbed molecules, q the partition func- 
tion for the motion of one molecule on a lattice site, assumed inde- 
pendent of the presence of a neighbour, E the energy of an adsorbed 
molecule in its equilibrium position, B the energy associated with 
two molecules adsorbed on adjacent sites, s the number of such pairs, 
and g the number of configurations then possible for the N + mole- 
cules. This can be rewritten as 

y e -jf + (B-n-fcrin«)/fcr e -sB(icr „ 

and it is evident that this is the partition function for the Ising prob- 
lem if we set 

E + = E-n-kTlnq 

E ++ = B 

E~ = E + ~ = E~- =0 
Here a=b and 

-In a = ^(E+B-n-kTlnq) 

-lnc =-L(E-r-kTlnq) 

A = -iB 

In this case 

m x = cosh [(E+B-iJ.-kTlnq)l2kT] 

+ [ e +Bi*T +sinh 2 [(E+B-a-kTlnq)/2kT\] 112 

and for the very simple situation where B = we have 
a = a 112 e ( " -w,2 * T 
m 1 = 2 cosh [(E -p-kT In q)/2kT] 
and 

jV-UnZ* ~ llntf+^p+lnm! 
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Here we are interested in N + , which is evidently 
■y-i„ 8Z N _ 8lnZ N 

so 

W/N = i+itanh [(kTlnq+^-E^lkT] 

If we consider the behaviour ofN + /Na.t fixed temperature when ft 
is varied, this runs from zero at large negative ft to unity at large 
positive ft. Half cover of the adsorber is reached when n=E—kTlnq. 

When ft is expressed in terms of concentration or pressure in a 
bulk phase (solution or vapour) in equilibrium with the adsorbing 
surface, the corresponding expression for N + /N is known as an 
adsorption isotherm. Of particular interest is the expression for ft in 
terms of the pressure of an equivalent ideal gas, since this is often a 
nearly correct expression in terms of the actual saturation vapour 
pressure. We have 

ft= -kT^n— +^ln h2 +ln^TH 

where 0t, "C~ are factors in the partition function for the rotation and 
internal vibrations of the molecule. This gives the very simple 
isotherm 

W/N = i+i tanh [a(T)+i \np] 

In this case, with 5=0, the non-interacting molecules are distri- 
buted entirely at random over the N adsorbing sites. For non-zero B, 
there is a tendency to crowding or spacing according to the sign of B, 
but the simple functional form of Z N cannot produce any phase 
transition, such as is known to occur in two-dimensional systems 
(surface melting). 

(iv) Competing adsorption — linear solution. If two different types 
of molecule compete for the same chain of adsorption sites, each site 
has three possible situations; occupied by a + molecule, occupied by 
a — molecule, or vacant. The method of analysis used above is 
directly applicable to this problem, but identifying the three values of 
m arising from a three-square matrix involves solving a cubic 
equation. 

The problem reduces to the case already solved if we take the 
chemical potentials of both types of molecule to be comparable, but 
so high that the configurations containing vacant lattice sites are 
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negligibly probable. The choice of parameters in the Ising problem is 
evidently 

E+ = V + -vi + -kTlnq + 

E~ = V'-{x--kT\nq- 

while E ++ ,E + ~, E~ ~ represent the interaction energies. 

Here V + , V~ are energies in the equilibrium positions, correspond- 
ing to E in the lattice gas problem. The energy A =E + ~ -%(E* * + 
E~ ~) is a measure of the energetic preference for forming bonds 
between like rather than unlike neighbours. 

If we consider how the distribution of molecules varies with tem- 
perature, while fi* and /x" are adjusted so as to keep N + and in- 
constant and each equal to N/2, we see that the distribution will be 
entirely random at very high temperature, and will depend on the 
sign of A at very low temperature. If A is positive, the configuration 
of lowest energy contains N/2 molecules of one type followed by N/2 
of the other with only one dissimilar neighbour pair. The energy of 
this configuration of complete segregation is 

^ (F+ + K-)+(f-l)£ + + + (f-l)£--+i? + - 

If A is negative the configuration of lowest energy consists of an 
alternating chain in which only dissimilar neighbour pairs occur. 
The energy of this completely ordered configuration is 

y(F + + F")+(iV-l)£ + - 

The energy of the segregated state is higher than that of the ordered 
state by 

i.e. 

-(N-2)A 

In the limit of large N then, A is the energy per molecule to be 
gained by forming the segregated state from the ordered one. 
Energy —A is gained by ordering, and if A itself is negative then — A 
is frequently called the ordering energy. 

Where E + + , E~ ~, E + ~ are negative, a stable linear chain, a co- 
polymer or linear solid solution, may be formed without an adsorbing 
base. The thermodynamics of this situation follow then simply by 
setting F + = K-=0. 
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The Bethe method 

The treatment of the one-dimensional Ising problem, given in the 
first part of this chapter, is unfortunately difficult to extend to two 
dimensions. In fact the Ising problem for a two-dimensional network 
has only been solved (originally by Onsager) for the restricted situa- 
tion with E + =E~; and no complete solution has yet been found for 
any case of the Ising problem in three dimensions. 

An alternative approach to the Ising problem, which gives the 
exact result in one dimension and gives rise to useful simple approxi- 
mations in two and three dimensions, is due to Bethe. There are two 
essential steps in Bethe's method. First of all, the extensive physical 
object to be studied is separated into a small system, which is de- 
scribed precisely, and an environment, whose properties are described 
by a parameter; secondly, a consistency condition permits the evalua- 
tion of this parameter in terms of the initial conditions of the 
problem. 

We have already seen, in the notation of the previous section, 

= e -*+/w( e -«*+/kr z + +e -£+-/*r z -) 

+ e -E-lkT ( e -£+ -IKT Z + +e -E" -/JW Z -) 

which we can recognize as a particular case of 

Cl 

where Ci is a distinct state of a system I, I is coupled to an environ- 
ment II, and F 2 (C 1 ) is the free energy of the environment under the 
constraint that I is in state C x . In the particular application, I is the 
(JV+ l)th site and II is the rest of the chain. Note that the energy 
of interaction between I and II is incorporated in F 2 . 

If the system I contains several adjacent sites, the value of F a will 
depend only on the states of those sites in I which are immediate 
neighbours of sites in II. We now apply the method with a more 
fruitful choice of I, taking this to consist of three typical adjacent 
sites somewhere in the interior of the chain. II falls into two com- 
pletely independent parts, L and R say, and there are eight distinct 
C x to consider. We examine the relative probability of these C x . Since 
L and R are independent we have 
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and 

/>(d) = e - K(C i )+ W +, '» (C i )1 " I 7Z I+II 

where F L {Cj) takes only two values F L (+) and F L (— ) according as 
the adjacent site in I is in state + or — . Similarly, Fr(Cj) takes only 
values F B (+) and F R (-). 
Now suppose that L and R are both so large that 

F L (+)-F L (-) = F B (+)-F B (-) 

independent of the size of L and R. Then />((?!) depends only on the 
state of the central site of I and the number (0, 1 or 2) of + states on 
the two outer sites. Thus 

n.(2. — n). , . _ e -[(n + l)S + +nE+ + + (2-n)E + - + (2 - n)E - VkT 

n -\£~n)- r \ _ n e -[nB+ +(2-n)«- - +nE+ - +(3-n)E-]/fcT 

where q is determined so as to normalize the set of probabilities p, 
by making their sum unity. 

The parameter z can now be determined by the following ingenious 
device. The expectation number of + states on a typical site must be 

n 

The expectation number of + neighbours of a typical site must be 
2.n[p(+,n)+p(-,n)] 

n 

For these two statements to be consistent we require 

n L n 

i.e. 

2p(-, 2)+p(-, l)-p(+, 1) = 2p(+, 0) 

which is a quadratic equation in z with one positive root and one 
negative. The negative root fails to meet the condition that p{± , 1) 
must remain positive, so z is uniquely defined. 
It will be convenient to contract the notation so that 

i«!(2-H)!/>(+, ") = qz n e- fi(£+ +2B+ _) e-^ n(E+ + ~ E+ "> 
= qz n 8y n 
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and 



where 



$nl(2-n)\p(-,n) = qz * e -i>ai-+2B--) e -BrtB+--B--) 
s qz n yx n 

y = ax, a = e-* £+ + +B ~ " ~ 2E + "> = e 2fA 



and other factors have been incorporated in z and q. 
Then the equation defining z is 

z 2 yx 2 + zyx — z8y = 8 
or 

yzx(l+zx) = 8(1 +zy) 

The second form is not useful in solving for z, but convenient for 
some other operations. 
The value of z is evidently 

which then defines the values of the p. 

The quasi-chemical equation 

A useful and elegant relation can now be had for the probabilities of 

the different bond types. 

A given site can only have (+ +) bonds radiating from it if it is in 
a + state and has at least one + neighbour. The expectation number 
of (+ +) bonds in which it shares is 

2« + + =2p(+,2)+p(+,l) 

so that the probability n ++ that a given bond is of (+ +) type is 
given by 



Similarly 
and 

Thus 


n ++ =qB(z 2 y 2 +zy) 

n~~ = qy(l+zx) 

n + ~ = q(S + Szy + yz 2 x 2 + yzx) 
= 2q8(l+zy) 

(n + -) 2 S 2 (l+zy) 2 




4n ++ n~- 8zy(l + zy)y(l + zx) 
x 8(1 +zy) 
y 8zx(l+zx) 

= - = e~ 2/M 

y 
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The formal resemblance between this result and the mass action law 
for a reaction A 2 +B 2 ^2AB has led to the former being named the 
quasi-chemical equation. 

Problem: The energy associated with a lattice site in any particular 
configuration is the energy of the state at that site plus half the energy 
of the bonds radiating from it. (The other half belongs to the sites at 
the other ends of these bonds.) Calculate the expectation value of the 
energy for a typical site, and show that this agrees with the value 
obtained from the previously calculated partition function. 

Since the Bethe method already gives the exact result for the prob- 
lem to which we have applied it, there is no point in generalizing to a 
larger system I than has been used. Moreover, the same correct 
result would have been obtained by postulating the quasi-chemical 
equation as the starting point of the calculation. However, the use of 
the Bethe method in one dimension involves one fact and one 
assumption, neither of which extends to two or three dimensions. 
The fact is that L and R are completely independent of each other 
because they are completely disconnected by the interposed group of 
3 sites. The assumption (which we know is right because of our prior 
matrix calculation) is that, for large L, F L (+)/F L (—) becomes inde- 
pendent of the size of L. In a system in which long-range order 
existed, this ratio would still alternate, as the number of sites in L 
was even or odd, even for large L. 

Thus in using the Bethe method in two- or three-dimensional 
lattices, the possibility of the correct type of long-range order must 
be built in at the start, by distinguishing the sub-lattices involved. 
This is easy enough to do correctly, knowing the natural phenomena, 
and we return to this later (pp. 106-1 17). The other, worse, difficulty is 
that a compact system I in the interior of the lattice is surrounded by 
a single connected system II so that the introduction of the same factor 
z n for n surface sites, independent of their arrangement, is in general 
wrong. It turns out that the assumption of the quasi-chemical equa- 
tion is equivalent to the Bethe method only when I consists of a 
central site and the set of its immediate neighbours. The Bethe 
method can sometimes be improved by a more elaborate choice of I, 
but the calculations rapidly become involved. 
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We have seen in the last chapter how to deal with particles having a 
particularly simple interaction, described by a single energy constant, 
in the particular problem of linear adsorption. The Bethe, or the 
quasi-chemical, approximation gives an approach to the behaviour 
of the corresponding 'lattice gas ' in two or three dimensions, and this 
problem will be reconsidered later (pp. 101-106). 

The interactions of atoms and molecules cannot in general be so 
summarily described. Since the existence and relations of the various 
states of matter depend on these interactions we must consider them 
now with more care. The simplest to describe, but by no means the 
simplest to calculate with, is the Coulomb interaction between 
charged particles. We shall leave this for later consideration (pp. 
135-140). 

Neutral atoms and non-polar molecules, then, have the following 
interactions. At long range, there is a rapidly fluctuating mutual 
polarization giving rise to an attractive potential energy varying as the 
inverse sixth power of the separation; this is the potential of the 
dispersion or van der Waals forces. At short range there may be a 
covalent binding energy varying roughly as an exponential function 
of the distance, exp (—rja), with a decay length a usually of one or 
two Angstroms. At very short range there is a strong repulsion, 
varying usually as a rapid exponential or a high inverse power (ten- or 
twelve-fold) of the distance. This repulsion arises from the Pauli 
exclusion principle which prevents the interpenetration of the 
wave functions representing the inner electron shells of the atoms. 
In the hydrogen molecule, where alone these inner shells are absent, 



Molecular interactions in gases 87 

the repulsion is merely the Coulomb repulsion of the hydrogen 
nuclei. 

A difficulty which is always present in principle is that the inter- 
action of two bodies must be affected by the presence of a third, since 
the third acting on the others deforms each of them and so changes 
their action on each other. 'Many-body' interactions of this sort are 
usually disregarded for the operation of the van der Waals attractions, 
with good justification, and for the overlap repulsion, with rather 
less justification: they are never negligible for covalent interactions. 

Covalent binding shows the particular characteristic of saturation. 
For example, two hydrogen atoms bind strongly to form a molecule. 
A third atom interacts much less strongly with the molecule. Two 
molecules interact with each other even less strongly. 

Let us consider, then, two extreme models. One represents in a 
simplified way the dissociation of a covalently bound diatomic mole- 
cule. We assume a strong interaction between atoms which are bound 
in a molecule, none between atoms and molecules, and, rather 
inconsistently, none between free atoms. What we really require is 
just enough interaction to make recombination possible, but not 
enough to disturb the equation of state. 

The other extreme model is one in which the potential energy of 
interaction of any two particles is quite independent of the presence 
of any others. This is quite a good model for the monatomic rare 
gases, and for many other substances under conditions where no 
dissociation occurs. 

Dissociation equilibrium in a gas 

Suppose that in a volume V we have enough material to constitute 
N molecules of a compound C, which is partly dissociated by a 
reaction whose chemical equation is 

yC = «A+j8B 
An example might be 

2NH 3 = N 2 + 3H 2 

If we have in fact N c =N—yM molecules of C present we shall 
have N A = a.M molecules of A and N B = j8M of B. Treating the species 
A, B and C as interpenetrating, independent hot dilute ideal gases, 
the partition function for the system becomes 

Z { N,M,V,T) = w ^ m ^) N ' yM 



- 1 ( Vq A aM x 1 ( Vq A 

\aMy.\Ai) X 08M)!Ul/ 
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and the corresponding Helmholtz free energy F is given by 

-^=(tf_ y M)(l + ln^J^) 

+aM ( 1+ln ^l) + H 1+ln ^l) 

so that the equilibrium condition for given N, V, T is obtained by 
maximizing this expression with respect to M to yield 



Problem: Verify the derivation, and that the result is equivalent to 
y/x c =a/i A +/?ju B , which is the thermodynamic equilibrium condition. 
The thermodynamic condition is true, of course, for reactants in any 
physical state. 

Notice that if the energy values used in calculating q A and q B are 
reckoned from the state where A and B are at rest at a distance from 
other objects, the energy zero for q is at —E a where yE c is the 
reduction of energy on assembling a, A and )8, B into y, C molecules 
likewise at rest. Thus q =q'o exp (EJkT) where q' is reckoned on a 
similar basis to q A and q B . 

The equilibrium condition in another form is 

M F v- a -„ Aj g A|' ql 
NXM A|" qlqi 

= K(T), an 'equilibrium' constant; 
or 

N£ = KNINi 

which expresses the law of mass action for this particular process. 
Alternatively 

N-yM = (Ka^M"* 11 ) 11 * 

For the dissociation of a diatomic gas, or for the example of 
ammonia, quoted above, (a+P)/y=2, and the equilibrium condition 
is a quadratic. For the general case, numerical solution is easy. Near 
the limits of complete dissociation and complete association, we can 
iterate, as follows. 

For complete dissociation, ^=0, i.e. 

N = yM 
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and the next approximation for nearly complete dissociation is to 
put this M into the other side of the equilibrium condition, to 
obtain 

yM x = N-(Ka"^M$ +B ) lhf 

yM 2 = N-(.Koc a p B Mf +B )^ 

and so on. 

For complete association Af o =0, and the next approximation is to 
take N C =N in the equilibrium condition, so that 

Ka a ^Ml +B = N r 

Ka a p B M%+ B = (N-yMtf 
and so on. 

The temperature dependence of K is completely dominated by 
exp (yE /kT) in most circumstances. It retains a volume dependence 
through the term y~"~ B , so that at given temperature one will 
always have complete dissociation at low enough density, but it is 
easy to see that for a dissociation energy of several electron volts 
dissociation will be nearly complete only at the density of inter- 
stellar space. 

Atom interchange 

One interesting example of a reaction equilibrium is the exchange 

2AB = AA+BB 

corresponding to y=2, a=/J=l, so that at equilibrium 

72 /A A X3 ^ 



N2b = /AaaAbbV 
JVaaATbb I A| B ) 



9aa9bb 



with the temperature dependent right-hand side dominated by 
exp \fi(E AA +E B n— 2£ab)] where E aB is the energy of the molecule ajs 
at rest measured from the energy of the separated atoms at rest. The 
energy — K^aa+^bb— 2£ab) is the energy per molecule of AB 
required to make AB from AA and BB. 

Bimolecular association 

Another special case, y = 1 , a = 2, j8 = is of particular interest. At the 
high association limit it is relevant to equilibria such as H 2 =H+H 
and at the high dissociation limit it gives a first-order model of the 
equilibrium (H2) 2 =H 2 -I-H2 between pairs and single molecules in 
gas not far from condensation. The values of E a for these two situa- 
tions differ by a factor of over 100. 
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If we put n c =N c /V and N A =NJV, the equilibrium condition is 



where 



so 



l 2_u _ jr 

"a Ag q\ 



n = n-njl 



n-\n A = Kn\ and n A = ^(VT+l6nK-l) 

Which way to expand the square root depends on the value of K. 
If that is very small (l6nK«. 1) 



and 



n A = 2n-8n 2 K+64n 3 K 2 + ■ • • 
n = 4n 2 K-32n 3 K 2 

P = (n A +n c )kT (assuming ideal behaviour) 
= 2nkT(l-2Kn+ 16K 2 n 2 + ■ ■ •) 



The development of the pressure as a power series in the numerical 
density is a well-known representation for the equation of state of an 
imperfect gas. The natural variable to use is the total number of 
molecules of monomer present, which is m=2n, so we have for our 
model 

p = mkT(l-Km+4K 2 m 2 + • ■ •) 

Comparing this with the usual notation 

p = mkT(l+bm + cm 2 + ■ ■ •) 
we see that the model predicts 

b < 0, c = 4b 2 

The Van der Waals equation of state, rearranged into the form of 
the virial expansion, agrees in giving a positive third coefficient, c, 
but no necessary relation between the third and second coefficients. 
Indeed, when the experimentally determined equations of state of 
various gases are expressed in virial form it appears that the second 
coefficient b is normally negative at low and positive at high tem- 
peratures. 

Evidently, the use of a mathematical model founded on saturating 
forces, to fit a situation where the forces certainly do not saturate, is 
decidedly optimistic. Something better has to be found. 
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Independently interacting molecules — Van der Waals equation 
We now consider molecular interactions such that two molecules, i 
and j say, have a potential energy of mutual interaction <j> l} which 
depends on their separation and relative orientation, but not on the 
presence of other neighbouring molecules. For a collection of N such 
molecules numbered by subscripts 1,2,...,N, the total potential 
energy is just the sum of the two body potentials 

care being taken to write the expression so that no particle interacts 
with itself and no term is included twice over. 

For atoms, atomic ions, and sufficiently spherical molecules, <j> i} 
depends only on the distance r w between molecular centres, and we 
may write 

where <f> is the same function for all pairs if all molecules belong to 
the same chemical species. Thus for a pure substance the partition 
function for N molecules moving in a volume V becomes 

-zn„h r r 



Z, = ^^J / ..J" r dr 1 ...dr Jf e- 



on the assumption that the quantum length A (=Vh 2 /2wmkT) is 
small compared with the characteristic distances which specify <f>. ■V 
should also include the sum 2i w(r 4 ) which gives the potential energy 
of interaction of the system with the walls of its container but this only 
gives rise to surface energy terms in the thermodynamic potentials, 
so we neglect it for the moment and take the effect of the walls to be 
simply to restrain every position r ( within V. If the molecules do not 
interact, the multiple integral is just V N , and the expression repro- 
duces the partition function for an ideal gas with q the partition 
function for the rotation and internal motion of one molecule. 

A typical shape for <f> is shown in Fig. 5. An effective diameter for 
the molecule is defined by d. This is obviously dependent on T, but 
the variation will be slight if <f> rises sufficiently steeply near r=d. 
Surrounding each molecular centre, then, is a volume b =4Trd 3 /3 
which is seldom penetrated by the centres of other molecules. At 
any instant, then, any one molecule is effectively excluded from a 
volume (N— l)b out of the total V (and from a relatively negligible 
volume of the order of V 2l3 d spread over the walls). 




Separation r 



Fig. 5 Potential energy of interaction of two molecules as a func- 
tion of the separation of their centres. 

Having thus eliminated the configurations in which the molecular 
interaction energies would be very large, we seek an estimate of the 
effect of attraction. The sumf is in fact half the sum over all mole- 
cules of each molecule's interaction with all its fellows, so it is N/2 
times the average potential energy of interaction of one molecule 
with the rest. This average is taken over molecules for one con- 
figuration of the whole system. We now require the average of 
exp (— p'f) over all configurations, and estimate this by taking 
instead exp [—^(average off)]. 

This step appears not to be valid unless the fluctuations off'" are 
small compared with kT. The last condition can only be met if the 
attractive part of <f> is very small, and if that is so, the attractive 
interaction can only be important if it has a long range to compen- 
sate. 
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This estimate is quantitatively good only for an unrealistic type of 
interaction, but leads to some reasonable deductions. The average 
value of "V is now iV/2 times the average interaction of one molecule 
with the others, which is itself proportional to the density of these 
others, and this we take as N/V. Thus the average "f~ is -N 2 A/2V 
and the partition function 

Z N = (NV- 1 A- 3N q N (V-Nb ) N e N2A ' ZVkT 

The corresponding Helmholtz free energy F is then 

F= -kTlnZ N 

= iWiW V-Nb )-N 2 A/2V 



and 



p== -w 



8F 
8V 
NkT N 2 A 



i.e. 



V-Nb 2V 2 
N 2 A\ 



(p+^jiy-Nbo) = NkT 



which is the well-known equation of state of Van der Waals. 

At high temperatures, the presence of one molecule will hardly 
affect the density in its neighbourhood, so A should be given by 



- f" 47rr 2 <f>(r) dr 



At lower temperatures, molecules will tend to linger in each other's 
attractive field, and A will increase as 



-T 4jrr 2 <Kr)e-* mikT dr 



This increase describes quite well the variation of the second virial 
coefficient of a number of gases with temperature. In view of the loose 
averaging procedure discussed above, one should not expect the 
expression to give a good absolute value. 

The virial expansion 

We now outline a method which is rigorous in principle, but im- 
possibly laborious in practice, for deriving the equation of state in 
the form of the virial expansion, that is, expressing pV as a power 
series in V' 1 . If the grand partition function were expanded in such 
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a series, the virial expansion would be had by taking the logarithm 
and eliminating p. The power series which represents the logarithm 
of a given power series is known, though the expression of its general 
term is unpleasantly complicated. 

We start with the familiar expression for the canonical partition 
function for a classical system of independently interacting identical 
molecules, suppressing, for simplicity, the factors referring to rota- 
tion and internal motion of the molecules. The expression is directly 
valid for a system of rare gas atoms. 

Z{N,V,T) = ^j... jdt x . . .dr„ e-™ r 
where 

^=2^), r tf =|r,-r ( | 

The exponential is thus a product of factors e\p(—cf> tj /kT), each 
containing the positions of only two molecules and different from 
unity only when these molecules are fairly close together. This 
property suggests an ingenious substitution due to Mayer 

e-*«" cr = l+/„ 
Then 

Restrictions on the subscripts, which become complicated to 
write down for later terms, have to express the requirements that 
each interaction, of molecule i with molecule j, say, may be named at 
most once in any term, and that a particular term must not be 
counted more than once. A particular molecule, i, say, may be named 
as taking part in several interactions in one term. 

Each factor f i} is only significant when r tj is within the effective 
range of molecular force. Thus for the simple term which consists of 
one fa alone, integration over all r ( and r } within V gives a factor of 
V arising from the free range of r ( , with r y — r ( fixed, while variation 
of Xj— r ( makes a constant contribution independent of V. 

The unit first term of the expansion, then, gives V N on integration, 
while the %N(N— 1) terms containing one interaction each give 
V N ~ x times a constant. In a general term, the appearance of common 
subscripts in factors such asfu, f ik implies a linking of molecules in 
such a way that the term is only significant when all members of a 
'cluster' are near each other. If a term referring to n molecules con- 
tains m clusters, it yields on integration v N+m ~ n multiplied by con- 
stant factors, and m cannot be more than \n. For example, for three 
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molecules, m= 1, and for four molecules, m= 1 or 2, while for four 
molecules in one cluster the number of interactions may range from 
three to six. 

The contribution to Z arising from a particular term depends on 
the number and size of the clusters, and the pattern of their internal 
connections, but not on the particular molecules named. Thus many 
terms give the same contribution on integration. The numbers and 
sizes of clusters are constrained to constitute a partition of N. 

Since the evaluation of Z involves a sum over partitions of N, 
there is good reason to evade this problem, as usual, by proceeding 
at once to the grand partition function 

S = 2e BWfcr Z(iV, V ,T) 

The further development is elegant but elaborate. 1 However we can 
without difficulty find the first terms in Z, obtaining 

xiJV^"< r -l]dr ia ... 

Nl \A 3 ) ^(N-2)l \A 3 ) A 3 J Le 1J 

xr 2 dr. . . 



= Zdeal (N, V, T) [l + N{N y 1} In j °° [e "««/*•- 1] 



xr 2 dr. 



•] 



The series in the brackets has the extremely alarming property 
that the size of its second term depends not only on N/V but on N 
as well. Thus for a constant physical condition denned by IT and N/V, 
we may reverse the balance of the series by increasing the size of the 
system considered. 

Caution can be overdone. If the expansion in V~ x converges at 

1 We refer to the treatments in Hill's Statistical mechanics (see Further Read- 
ing, p. 146) and in Theory of linear graphs by Uhlenbeck and Ford {Studies in 
statistical mechanics, vol. 1, Ed. de Boer and Uhlenbeck, North Holland 
Publishing Co., 1962). 



96 Elementary Statistical Mechanics 

all, we have the correct first terms, and for V, as it must be, much 
larger than 



05 f" [e-*' fcr -l]r 2 dr 



we may choose Nto make the second term small so that 

F = F iieal -kT N( - N ~V 2irv. . . 

and 

2 



p N (N\ 2 . 



which is physically sensible. The question whether the series are well- 
behaved lies well beyond our present scope. The result for this virial 
coefficient is in fact exact. 

Problem: For what condition on </> would the second virial coeffi- 
cient derived here coincide with that obtained from the van der 
Waals equation derived above ? 

Physical clusters 

For some purposes it is important to estimate the very small con- 
centration of relatively large molecular clusters present in a gas in 
equilibrium. When a saturated vapour is suddenly cooled, for ex- 
ample by adiabatic expansion, detectable droplets are most readily 
formed by the growth of the largest clusters previously present. 
Although in a natural atmosphere such clusters are generally stabil- 
ized by the presence of solutes such as salt, it is interesting to estimate 
what would happen in a pure substance. To do this, we extend the 
treatment of dissociation given above. 

Suppose that at any moment the vapour of a pure substance in a 
given volume V can be regarded as made up of single molecules and 
clusters of two, three and more. Let there be JV y clusters containing 
the same number/ of molecules, and let the partition function for the 
rotation and internal motion of such a cluster be q t . The canonical 
partition function for N molecules is then 

z(n,v,d= 2 n^r(§f 

partitions j ^j-\ •"»•/ / 

where the sum runs over all partitions of N into groups of various j 
sothai2iM=N. 
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The sorting out of this sum is not straightforward, so we simplify 
the calculation by specifying the chemical potential p instead of N. 
Now 

H( M , V, T) = 2 Z{N, V, T) e" NlkT 

N 

I /Vq 1 e"" kT \ N J 



-i x njf,P<F) 

N partitions J Jv y : \ Iy i I 



Since any arbitrarily selected set of N t now gives an admissible 
member of the sum, we have no partition problem and can write 

Vq, e"" ter \ N i 



= exp[2(^A y - 3 e"" fcr )l 



= exp (j>V/kT) 
so 

The expectation number of clusters with j members is the sum of 
each possible value of N } times the relative weight of those terms 
which correspond to this N } . This is 

"' q> dq, A? & 

each set of clusters contributing to the pressure, then, as an ideal gas 
of Nj molecules. 

For j— 1 this is restating our neglect of passing interactions be- 
tween molecules, for the equation relates N t to p for an ideal mono- 
molecular gas. 

Further progress depends on taking a specific form for the q,. It 
is reasonable that a large cluster in the vapour should have the 
structure of whichever condensed phase equilibrates with vapour at 
the specified temperature. Let the chemical potential in that phase be 
ix c at the given temperature and pressure p. Then the Gibbs free 
energy of a group of j molecules in the interior of the condensed 
phase is .//*,., and the corresponding Helmholtz free energy is j(i c — pv t 
where v } is the volume of the group. The free energy of the cluster is 
greater than this by its surface free energy, which we write as S^: 
Sj may be defined to be Anrj where ty=4nvf/3; <j } is likely to vary 
substantially withy for small j. 
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Problem: Disregarding all thermal motions, estimate o, for small 
values of j for a substance with nearest-neighbour bonds only and a 
face-centred cubic crystal structure. Count broken bonds and be 
careful about factors of 2. 

The expectation iVy may now be expressed as 
or since A^j'^'^A^, 

j m „v„ r~'*i »™ \fo-vti+in>i-Sfitj 



jg-s-teHF 



kT 



In these expressions, the factor 0t } is introduced to allow for the 
free rotation of the cluster in the vapour, compared with the restric- 
tions on its rotation when buried in the condensed phase. This factor 
will be greatest when the condensed phase is solid, since three degrees 
of freedom which are then transverse waves, with wavelength of the 
order of twice the diameter of the cluster, become degrees of free 
rotation when the cluster is released into the vapour. 

The transverse wave of wavelength 4r t has frequency c/4r>, where 
c is the appropriate speed of sound, and the corresponding partition 
function at high enough temperature is (Ir/kT/c) 3 for three degrees of 
freedom. 

An accurate estimate of the rotational partition function is beyond 
the scope of this book (see, for example, Rushbrooke), but its order 
of magnitude is well enough given if we take the p.f. for a linear 
molecule, which has two rotational degrees of freedom, and raise it 
to the three-halves power, (2IkT/h 2 ) 312 . This gives then 

IT I Id 2 \ 3 ' 2 

®* = to, \Wkf) 

A fair guess for a liquid may be made if we imagine a hypodermic 
syringe with a needle of molecular dimensions immersed in the bulk 
liquid. Liquid may be drawn in reversibly with zero expenditure of 
work, provided the liquid just wets the wall of the syringe, and we 
suppose a drop of molecules extruded into the vapour region by 
doing an amount of work determined by the pressure change and 
the surface energy. 

While the drop remains attached by a small part of its surface to 
the needle of the syringe, those modes of motion which will become 
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the free rotations correspond to oscillations controlled by the surface 
tension of the drop, with frequency a,/m, where a, is the surface 
tension and m, the drop mass. The partition function is (kTm,lha,) 3 
and correspondingly 

•< - mw 

Problem: Estimate 9t } for ice and water clusters with j= 10 and 
100, at 0°C. 

Although 9t, is not negligible, its interpretation is still doubtful. 
After all, one could have defined 91, to be unity, and subsumed the 
rotation effects into the variation of a,. In any event, 9t, varies as a 
low power of T, and the exponential will dominate it completely. 
We neglect it in the next paragraphs. 

The condition of equilibrium between condensed phase and vapour 
is /x=/v The vapour can of course exist by itself at /*</*<;. States of 
the vapour for which fi>[i c represent supersaturation and are at 
most metastable. However, in supersaturated vapour, small clusters 
are still more likely to evaporate than to grow. 

The Gibbs free energy of a cluster is j[j. c +SjCi, so the chemical 
potential in it is Hc+iSj+iVj+i-Spj)- If a i+i= a i> tne chemical 
potential is 

dS , lav 

where v is the molecular volume in the condensed phase. 

For any /*. larger than j^ c , then, there exists an r such that /*=/*<. 
+2<w/r, giving the radius of the cluster which is by itself in equili- 
brium at this p. Smaller clusters are more likely to evaporate than 
grow; larger ones are more likely to grow beyond limit (i.e. in prac- 
tice, till they fall out of the supersaturated vapour region). 

In a supersaturated vapour it is strictly nonsense to discuss 
thermodynamic equilibrium conditions, but a kinetic discussion, 
which lies outside our present scope, shows that at low supersatura- 
tions the concept of a definite /u, is valid and the 'equilibrium' 
expression for N)/Nl remains reasonable so long as j is well below its 
value, (irl6v)x[2avl(ii- p c )] 3 , for the equilibrium cluster. 

It is important to note that even if the problems of evaluating 91, 
and of defining a, were resolved, we have not constructed a correct 
account of the imperfect gas. The reason lies in the difficulty of 
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making a clear distinction between a cluster and a momentarily close 
group of molecules. If a distinction is imposed in dynamical terms, the 
cluster needs to have a bounded energy (negative, for the usual choice 
of energy zero). This is incompatible with a well-defined temperature, 
so q, must be redefined. This can of course be done, but one loses the 
simplicity which makes the approximate model instructive. 

A very simply related discussion can be applied to find the equili- 
brium density of bubbles in a liquid. Here again, a bubble in a super- 
heated liquid must pass a certain size before it reaches the 'runaway' 
state of continuing growth. The absence of suitable nuclei is re- 
sponsible for the phenomenon of 'bumping' in the boiling of clean 
liquids. Two familiar devices for detecting the tracks of high energy 
particles, the Wilson cloud chamber and the Glaser bubble chamber, 
depend on the nucleation of an unstable fluid. In the cloud chamber, 
drops are nucleated by ions, which stabilize the condensed phase 
with its high dielectric constant: in the bubble chamber, the bubbles 
are nucleated by intense local heating associated with the short tracks 
of slow electrons. 



ORDERING ON A LATTICE 



The lattice gas 

Solute molecules in a crystal lattice or adsorbed molecules on a 
crystal surface, if they are sufficiently tightly bound that each mole- 
cule vibrates many times about one position of mechanical equili- 
brium before making a diffusive jump to another similar position, 
give rise to a simplified statistical problem in that the integral over 
all configurations reduces to a sum over a finite set of crystal 
arrangements. 

Mutual exclusion effects appear naturally in the condition that 
only one molecule may occupy one site (or connected group of 
neighbouring sites) at one time. The simplest situation arises when 
one molecule occupies one site and interacts only, and independently, 
with its nearest neighbours. As was seen in the previous chapter, this 
is a case of the Ising problem. For a structure in more than one 
dimension, an approximate method of solution must be employed. 

Consider a crystalline structure of N molecular sites. Each site is 
surrounded by z equidistant nearest neighbours, which may or may 
not be nearest neighbours to each other according to the particular 
structure type. In the body-centred cubic lattice, z is 8 and no 
nearest neighbour of a given molecule is nearest neighbour to another 
member of the group: in the face-centred cubic lattice, z is 12 and 
each nearest neighbour of a given molecule is nearest neighbour to 
four other members of the group. 

The total number of 'bonds' joining nearest neighbours in this 
structure is \zN, for each molecule is the terminus of z bonds and 
each bond has two terminal molecules. To fix our ideas, consider a 
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solid solution of N A molecules of species A with N B molecules of 
species B, where N A +N B =N. These molecules can be arranged in a 
vast number of configurations in which the numbers of AA, AB and 
BB nearest neighbour bonds vary widely. 

For example, if all the A molecules are aggregated into a crystal 
region of pure A, the number of AA bonds (disregarding the surface 
of the A region) is %zN A . If on the other hand N A is small enough 
(<iV B , for the body-centred cubic structure), the A atoms may be 
dispersed so that the number n AA of AA bonds is zero. Since in 
general the energy of an AA bond differs from that of an AB, and 
in turn from that of a BB bond, different values of « AA correspond to 
different energies of the crystal even for constant JV A . 

Since each AA bond involves two A atoms, its formation entails 
the loss of two potential AB bonds, so the number of AB bonds is 
necessarily zN A — 2n AA . The number of BB bonds must then evidently 
be %zN-zN A +n AA . Thus the numbers of AA, AB and BB bonds are 
determined when N, N A and any one of n AA , n AB or n BB are known. 

As a rule, different configurations will give rise to different normal 
modes of vibration for the structure, but the mechanical problem of 
determining these modes is very intractable. The simplest procedure 
will be to adopt an Einstein model, in which each molecule is sup- 
posed to vibrate independently in the average potential field deter- 
mined by its neighbours (we ignore anharmonic effects in the poten- 
tial field arising from its neighbours at rest in their equilibrium 
positions). This will be considered independently of the configuration 
problem (see pp. 105-106). 

The potential energy of the system at rest, and a reasonable 
approximation to the molecular oscillations, will be determined by 
the numbers of bonds present of the three distinct types, and so by 
N, N A , n AA in particular. We therefore require to know the number 
of distinct configurations of the system, given these three numbers. 
This has been directly obtained for the one-dimensional problem, but 
is not so easily estimated for two and three dimensions. 

It is clear that if n AA is not specified, the number of distinct 
arrangements of N A , A molecules and N B , B molecules is N\I(N A \) 
(JVb!) for a given set of N sites. 1 This result, while not immediately 
helpful, provides a useful constraint on any approximation. 

1 Students may rightly worry about the fact that the N sites are determined 
by the interaction of the molecules and not magically provided in advance. A 
cluster of N connected sites of a given crystal structure may take an enormous 
variety of shapes and attitudes, but those which deviate far from a convex 
shape, which is (roughly) the shape of minimum surface area (and, exactly, the 
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Another helpful preliminary result comes from considering a 
wholly random distribution of molecules, such as may arise if A and 
B differ only by an isotopic substitution. The expectation number of 
A neighbours of a given site is then zNJN, and the expectation 
number of B neighbours is zN B /N. The expectation numbers of AA 
and BB bonds are therefore izN%]N and izN B /N respectively, and 
the expectation number of AB bonds is zN A N B /N. We have then for 
the random arrangement 

«ab = 4n AA n BB 

Quasi-chemical approximation 

The above result agrees, for zero value of the ordering energy, with 

the quasi-chemical equation, exact in one dimension, 

W AB _ AQ-2ElkT 
n AA. n BB 

with E=E AB — $(E AA +E BB ) where E ag is the energy (not the bind- 
ing energy, but the negative of a binding energy) of an a/3 bond. 1 

Rearrangement of the equation in terms of n (smaa) alone gives 
the quadratic 

z 2 Nl-2zN e~ x n-4zN A (l-e- x )n+40-~e- x )n 2 = 

where x=2E/kT. The balance of power in the middle of the equation 
depends essentially on the relative value of N A /N and exp (— x). 
Very large x leads to n=zNJ2, the condition of complete segregation 
of the A molecules in a single cluster, while zero x gives the value of n 
corresponding to the completely random distribution. 

The solution of the quadratic is conveniently expressed in terms of 
a variable y=e x — 1, and we obtain 

n = ^ + zN ± zN V(y+AyN ^ jNZ) 

shape of minimum surface free energy), have high surface free energy and cor- 
respondingly low probability of occurrence in thermodynamic equilibrium. 
So long as we only wish to discuss bulk properties, then, variations of shape of 
the group of N sites may be neglected. 

1 Guggenheim was the first to suggest that the quasi-chemical equation could 
be assumed to hold generally. He derived various consequences, and, with 
others, later showed that this quasi-chemical approximation is exactly equi- 
valent to that form of Bethe's approximation in which the Bethe sub-system 
consists of one site with all of its nearest neighbours. We have already obtained 
the quasi-chemical equation by the Bethe method for the one-dimensional 
system, and a further discussion of the quasi-chemical method is given in a 
later section. (The proof of the general equivalence is to be found in more 
elaborate texts.) 
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Since the maximum geometrically possible value of n is zNJ2, and 
the minimum is zero, it is evident that only the negative root gives 
an acceptable solution. The effective tendency to aggregate is seen to 
depend on the value of 4yN A N B /N 2 . Provided this is numerically 
small it is a suitable variable in which to expand the square root, and 
the solution to second order in y is then 

n = IN (1 +y N BN- 2 -2y a N A NiN~ i . . .) 

For small N A , N B =N and we have nearly 

zN? 
n ~ 2N e 

In the true lattice gas situation 2E= —E AA , which is the binding 
energy between two neighbouring molecules. 

Thus in the dilute gas the concentration of 'dimers' differs 
from its value for a purely random distribution only by the factor 
exp (binding energy /kT), provided |j;|«iV/4iV A . This corresponds in 
form to the result for the concentration of dimers in a dissociating 
gas, which was obtained above on the assumption of saturating forces. 

The similarity of these estimates is not surprising, for, if the con- 
centration of monomer is so low that no aggregates larger than 
dimers are likely to be present, the difference between saturating and 
independent interactions has no chance to manifest itself. If we go 
back to the exact solution of the quadratic, however, we have a result, 
limited only by the correctness of the quasi-chemical approximation, 
which may be written 

The internal energy of the lattice gas is then 

U = nE AA 
and the entropy 



•T 



. x w. 



The integration, though laborious, is elementary, provided y > 0, and 
leads to the Helmholtz free energy 

F = izN A E AA +NkT[8 In 9+ (1 - ff) In (1 - 6)] 
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where e=N A JN,p^{l+k'y) lia t y=eaip (-E A JkT)-l. Note that all 
but the first term of this expression is symmetric between and 

0-0). 

Problem: Evaluate (dF/BN)^ and deduce an effective pressure for 
the lattice gas with attractive interaction. 

In the true lattice gas, E is half the binding energy of an AA pair 
and is generally positive. In a solid solution, however, E may take 
either sign, so we must consider also negative values of y. Since y 
now goes to - 1 as T goes to zero, the integral leading to Fno longer 
converges, indicating that this form of quasi-chemical approximation 
must be breaking down at T=0. One reason is in fact quite evident. 
If we ignore surface bonds, the requirement Hab = 0, which arises from 
T=0, E>0, can always be fulfilled by complete segregation. On the 
other hand, the requirement w AA or n BB = 0, arising from T=0, 
E<0, is geometrically possible only for special structures and con- 
centrations. For example, in a face-centred cubic lattice with 
N A =N/4, all A atoms may be set on one of the four simple cubic 
lattices which interlace in this structure, to give Haa=0, but k A a>0 
for any NJN>%. 

Systems of this sort, in which an ordered state exists for suitable 
concentrations and low enough temperatures, are of particular in- 
terest. The configurational entropy associated with the distribution 
of molecules over lattice positions is best estimated by reference to 
the ideally ordered state. This calculation will be discussed in a later 
section (pp. 108-117). 

Molecular oscillations in the lattice gas 

The vibration of a molecule about its equilibrium position in the 
lattice will depend on its own nature and on the nature and relative 
disposition of its neighbours. For example, an A with two A neigh- 
bours will behave differently according to whether these are polar 
opposites or he next to each other. As a crude estimate, however, we 
might neglect this and set each molecule in a spherically symmetric 
oscillator potential whose stiffness depended only on the number of 
neighbours of each type. 

Then the vibration rate a> Ao for an A molecule with a, A neighbours 
would be given by 

*>ia = - [fifcoL+Cz-aViol 
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In a binary solid solution, we should have correspondingly 

<"L = z [a<»Bz+(z-a)a>% ] 

where m B ati z =m A a> A0 if the AB bonds for B surrounded by A and 
A surrounded by B have in fact the same stiffness. Moreover, it 
would not be surprising if ojIJE^ were nearly equal to *> A0 /E AB . 

At sufficiently low concentration we need consider only a=0 and 
1, having N A —2n AA isolated molecules and 2w AA molecules with one 
A nearest neighbour. At temperatures high enough for the Einstein 
approximation to be worth using, the partition function for each 
degree of freedom with angular frequency at is just kT/ha>. 

Even within the approximation of associating a fixed spring con- 
stant with each type of bond, it would be more realistic to distinguish 
the different normal modes of the AA pair, but it is rather doubtfully 
worth while. This sort of analysis is certainly needed when one studies 
lattice mechanics, for example by neutron diffraction, but other un- 
realities of the model are probably more important when comparison 
is made only with thermodynamic data. 

Problem: Find the free energy of a dilute lattice gas of N A , A 
molecules on N sites, using the above assumption about their oscil- 
lations and assuming (falsely) that A and AA molecules are inde- 
pendently distributed over the lattice. Note that the centres of AA 
molecules have %zN possible sites. 

Order and disorder 

It is well known that a number of metallic alloys, whose compositions 
are close to simple atomic proportions such as AB or AB 3 , have an 
orderly structure at low temperatures, quite analogous to the struc- 
ture of ionic crystals with similar atomic ratios. These ordered alloys 
are frequently rather brittle and resonant. They pass at higher tem- 
peratures into a disordered state in which the general lattice structure 
is unchanged but atoms of different species do not concentrate on 
distinct sets of lattice points. The order-disorder transition is generally 
accompanied by a large peak in the specific heat-temperature graph, 
but a small or apparently zero latent heat at the precise transition 
temperature. The typical variation is as shown, the transition being 
called a A-point by an obvious similarity. 

We have already met a difficulty in applying the quasi-chemical 
equation to the Ising lattice gas when the interaction is such that like 
atoms or molecules have an energetic preference for remaining apart. 
This is, however, the situation that corresponds to order in such 




Fig. 6 Variation of specific heat with temperature round a 
typical A-point transition. 



alloys as FeAl, in which two simple cubic lattices of aluminium and 
iron atoms interpenetrate in such a way that each cube of iron has 
an aluminium atom at its centre and vice versa. In the state of ideal 
order at the exact stoichiometric composition, all nearest-neighbour 
bonds in this structure are of AB type. It will be interesting to see how 
far approximate methods can deal with the ordering at finite tem- 
perature and possibly non-stoichiometric composition. 

The geometrical impossibility of fitting the previous quasi- 
chemical equation for some temperatures and compositions suggests 
that the lower 1 symmetry of the ordered state ought to be recognized 
from the beginning in setting up the theory. 

1 The association of lower symmetry with ordering is due to the fact that a 
highly structured situation has fewer symmetry elements than, for example, an 
isotropic situation. Thus a gas or liquid (far from any rigid boundary) has com- 
plete spatial symmetry; a crystal has less, and an ordered crystal less again. 
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This is not very satisfactory, for a good theory ought to derive 
the breach of symmetry from the molecular interactions. In the pre- 
sent state of knowledge, progress in this direction depends on rather 
advanced theoretical technique. A more empirical approach, accept- 
ing the experimental knowledge, which we already have, as pre- 
scribing the expected order for an alloy of given type, is very much 
easier. 

In a given ordering system, then, we consider the structure of its 
highly ordered condition as determined, say, by diffraction experi- 
ments. The whole lattice is divided into the sub-lattices occupied by 
atoms of different species in the ordered state, and various order 
parameters are defined to describe the partial order which is all that 
we ever actually have. 

These order parameters fall into two groups. One set actually 
vanishes above the transition point, when the populations of the 
different sub-lattices contain identical fractions of the various species : 
they determine the long-range order. The others relate to the devia- 
tion of the numbers of the various bond types from their values for 
a perfectly random atomic distribution, and vanish only at infinite 
temperature: they describe the short-range order. 

In the simplest situation, one long-range and one short-range 
parameter suffice. We consider as an example of this an alloy of 
composition A^B,,, with the A and B atoms occupying JV sites of a 
body-centred cubic lattice. 

There are then N A atoms of the type A and N B of type B, where 
NJN B =x/y and N A +N B =N. FeAl is an alloy of this class. The 
lattice consists of two simple cubic sub-lattices, and every atom in 
either sub-lattice is at the centre of a cubic cell of the other sub- 
lattice: its nearest neighbours are the eight atoms at the corners of 
that cube. When x=y, the state of perfect order is that described 
above, with all the A atoms on one sub-lattice, all the B atoms on the 
other, and all bonds in the structure of AB type. 

One sub-lattice is called the a lattice, the other /?, and we decide 
arbitrarily to call an A atom rightly situated if it is on the a lattice, 
wrongly situated if it is on the J3 lattice. Of the N A , A atoms, R A lie 
on a and N A -R A on 0: of the N B , B atoms, R B lie on p and N B -R B 
on a. The number of wrongly occupied a sites is W A =N B — R B = 
iN-R-A, and is the number of wrongly placed B atoms. The number 
of wrongly occupied p sites is W B = N A - R A = $N- R B . If a portion of 
the alloy is in the ordered state, the sensible choice of the a lattice will 
be that sub-lattice with most A atoms on it, otherwise the choice is 
insignificant. The choice of long-range order parameter is to some 
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extent also arbitrary. It is reasonable to choose it to vanish when the 
sets of A and B atoms are both evenly divided between the a and /J 
lattices. A possible choice is 

9> = 2(2R A -N A )/N = 2(2R B -N B )/N 

which vanishes, as desired, when R A =$N A . The factor 2/JV gives the 
second convenient property that SP=\ for R A =N A =N/2— perfect 
ordering of the 50-50 alloy. We have then 

R A = ^N A +iS^N, R B = ±N B +iSPN 

W A = ±N-R A = iN B -iSrN 

W B = \N A -^N 

Neither W A nor W B may be negative, and if N A >N B , which we may 
always choose to make true by which atom we choose to call A, 
only W A may be zero. The greatest value of Sf is then 2N B /N, which 
we shall call y. (Then 2NJN=x, say, and x+y=2.) 

Let the number of nearest neighbours of one atom (actually 8 in 
b.c.c.) be z. Then the number of bonds in the structure is \Nz and 
each bond joins one a to one j3 site. Since 'right' and 'wrong' 
atoms are distinguished in the ordered state, we distinguish between 
AB bonds, joining right atoms with A on a and B on ft and BA 
bonds, joining wrong atoms with A on /3 and B on a, although AB 
and BA bonds have identical energy. Let the numbers of bonds of 
the diiferent types be Gaa, Cab, Gba, Gbb- These numbers are con- 
nected, as may be seen by considering in turn the bonds which end on 
right and wrong, A and B atoms. 

Thus 

6aa+I2ab = ^a 

GaA+GbA = Z^B 

Q bb +Qb A = zW a 

Q bb +Q A b = zRb 
or 

e AA = ^A-eAB 

Gbb = z^b-6ab 

Gba = zWb-Qaa = QAB-iz^N 

so once N A , N B and Sf are known, Q AB suffices to define all the Q 
and so the bond energy. We shall see Gab itself as a short-range 
order parameter, though something more elegant can easily be 
manufactured. 
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The total bond energy is simply 

E(Q) = eAA^AA+eBB^BB+CeAB+eBiO^AB 

= $zN a E aa +$zN b E bb + QQab-&S"N)A 

where A is the Ising energy £ A b— K-Eaa+.Ebb). This use of the letter 
A is consistent with the previous chapter, and should cause no 
confusion. 

We come now to the much harder problem of estimating the 
number of configurations of given energy, that is, for given values of 
Sf and Q AB . If Sf alone is fixed, the count is easy. We have to distri- 
bute R A , A atoms over NJ2, a sites, and independently to distribute 
R B , B atoms over the p sites. The number of distinct configurations 
is then 

m <n = W 2 > ! W 2 )' 

1 > R A \W A \R B \W B \ 

These configurations, however, range over all the possible values of 
Q AB , so do not correspond to a fixed energy. 1 

The energy is of course given by the single parameter <2ab+ Qba, 
but we have already been led into difficulty by relying on that alone. 
If 6ab and Q BA are separately specified, Sf is determined, and we may 
obtain a more reliable count of configurations. 

Once the Q have been fixed, the arrangement of the various bond 
types is still constrained by the requirement that one atomic site can 
be occupied by either an A or a B atom, but not by both. Consider 
the imagined structure generated by a random distribution of 
specified bonds over the \Nz bond positions in a body-centred cubic 
lattice. In general one would find the atoms at many sites specified as 
A by some of their bonds, and as B by others. The number of realiz- 
able distributions will be a fraction of the number of random 
distributions of bonds, and this fraction must depend on both Sf and 
Cab- To identify it would be the so-far unachieved complete solution 
of the problem. 

1 A useful and important dodge is to cut off the calculation at this stage by 
assigning the same suitable mean value to all states of given Sf. The simplest 
procedure is to assign to each atom on a, 2zR B /N, B neighbours and ZzW B /N, A 
neighbours, and conversely. The corresponding energy we denote by E BW (sr) 
since the method was suggested by Bragg and Williams (Proc. Roy. Soc 
A145, p. 699, 1934) and then the free energy 

F(Sf, T) = E B v(SO-kT hi W(sr) 

may be minimized with respect to Sf to find the thermodynamic equilibrium 
state. 
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The shortcut which generates a new variety of quasi-chemical 
equation is to assume that the correction factor depends on Sf alone. 
It is plausible, from the dependence of the number of random distri- 
butions on the Q, that the correction should be more sensitive to 
Qab — Qba than to Q AB + Q BA , but hard to go further. At least, we can 
now make sure that not only are the states of given S? correctly 
counted, but their energy levels are correctly listed, while we hope 
that the number of states on each level is not too badly distorted. 
This should represent an advance on both previous approximations. 
We write, then, for the number of configurations of given £f and 
Qab 

w{?, q) = mn , Q (z ^ )! , , 

y AA'- yAB' iiBA'- ViBB 1 - 

and propose to determined R{^) from the requirement that 

2 m?, Q) = W(ST) 

Qab 

This need only be carried through with enough precision that In W 
is known down to first order in N, since terms in N 2 ' 3 , In N, and so 
on are not perceived in the bulk thermodynamic functions. The 
quadrinomial coefficient in W{Sf, Q) rises so spectacularly to its 
maximum permitted value that the number of terms near that 
maximum which make up nearly the whole of the sum is negligible 
compared with the JV. Thus we need locate only the minimum of 
In (Gaa! Gab! Gba! 2bb!) for the given Sf to evaluate In R(£f). Stirling's 
expansion gives then 

In W{ST) = 2^1n^-^- J R A lni? A + i? A 

- W A In W A + W A -R B In R B +R B - W B In W B + W B 

= JVIn— 2-Rlni? 

where 2 R In R is a contraction for 

R A In R A + W A In W A +R B In R B +W B In W B 
Similarly 

In W{y, Q) = ln^(^)+iiVzln(iJVz)-iJVz-2 Q In 2+2 Q 
= In R(ST)+iNz In (iJVz)- Qab In Qab 

-tQAB-^VN) In {Q AB -WN) 

- (zR A - 2^) In (zR A - Q AB ) 

- {zRb ~ Qab) In {zR B - Q AB ) 
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Setting q= Q^, then, we have 

-p- In W{Sf, q) = -In tf-ln (q-$zS?N) 

+ In (zR A -q)+ln (zR B - q) 
which must be zero for the turning value q , so, 

q (q -iz^N) = (zR A -q )(zR B -q ) 
i.e. 

q = 2zR A R B /N 

We approximate R(Sf) (or redefine it, at choice) by the relation 

In W(S?) = ATln ~£ i? In U 

= In i?(^+i z iVln (iziV)-2 Qo in 6o 

the go corresponding to the turning value. Elimination of R(S?) gives 
then 

In W{Se, Q) = Nln ^-£ * In i?+2 go In go-2 Q In 

and the 'constrained free energy' for given N A , N B , 6?, Q AB is 

F= E(Q)-kT In W(S?, Q) 

The true free energy is to be estimated by averaging over all con- 
figurations, but we use essentially the same shortcut twice more, 
first to find a less constrained free energy for given £?, then to find 
what Sf gives the thermodynamic equilibrium state. The assumption 
is that the average value is practically coincident with the value at the 
minimum of F. 

To find the free energy for £P, alone, prescribed, we take the rele- 
vant part of F/kT, which is <j>, equal to (2AQ AB /kT) + % Q In Q, and 
differentiate with respect to q= Q AB ; so for the turning value 

= ^ = (2A/kT)+lnq+ln(q-izS*N)-ln(zR A -q)-lnizR B ~q) 

or 

e-^'* r izR A -q)(zR B -q) = q(q-$zS?N) 
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i.e. 

e- 2AlkT QaaQbb = QabQba 

Now Cab=6ba if SP=0, and the last equation is then identical 
with the quasi-chemical equation used above as the starting point 
for the discussion of the lattice gas. 

Thus if, as we shall see, there is a temperature region in which 
S?=0 for the ordering system, the expressions derived above for the 
entropy and free energy must be valid in that region. For non-zero 
values of y, 2ab#6ba- 

The above quadratic in q is simplified by introduction of the symbol 
/for [1-exp {lAjkT)]- 1 . Since A is negative in an ordering system, 
/ranges from 1 at T=0 to infinity at T infinite, being -kTjlA for 
kT»A. 

The solutions of the quadratic are given by 

p = ^ =/+^±[/ 2 -/(^ 2 +^)+^ 2 ] 1/2 

but the lower bound on/ while q<,\z£PN, forbids the positive root. 
For large values of/ 

p ~ sr+x?> 2 +xy) 

which reduces to the value \xy for a purely random distribution 
at f=0. 

Inserting now the values of Q corresponding to the q just deter- 
mined, we are to minimize 

x = ^(2q-iz^N) + 2 R In *"2 9o In <7o + 2 q In q 

by varying SP. Because of the preceding operations, terms which 
arise from the implicit dependence of q and q on Sf all cancel, and 
there remain in the derivative only 

-^ + ? ln ^ B -s^ (2?oln?o)+ ^ (2?ln9) 

where the derivatives symbolized by dldS? are taken (after expressing 
each Q in terms of the corresponding Q AB ) as if q and q were con- 
stants. 
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Thus 

= [ln(Q-izS?N) + l](-lzN) 

+ [ln(ztf B -0+l]^ 
remembering that R A , R B both depend on S?. Then 

**= zNA i ^in * A * B 
d^" 2&7*" 1 ~4 m JF A fF B 

ziV (zR A -q)(zR B -q) zN q-jz&N 

4 (z# a -<7o)(z*b-<7o) 2 f -Jz.?W 

or, referring back to the equations defining q and q , 

dx = N R A R B zN q(q -jz^N) 
AST 4 m ^ A fF B + 4 q {q-%zSfN) 

and the turning point condition can be expressed in several ways, of 
which one is 

2 AJ\ _i\ ln kx-STKy-ST) _ . (x+^-p)(y+^-p) 
kT \ z) m (x + Sf)(y+#>) m {2£e-pf 

It is not hard to verify that £f=0 always satisfies this equation. 
This is depressing at first glance, so we examine the nature of the 
turning point. The value of d 2 x/d^ 2 at ^=0 is 

N 2 N (zN) 2 (zN) 2 



4 N A N B 8q Sq 
which changes sign at 

zN _ zN_l_ N 2 
4q ~ 4q 2z N A N B 
i.e. 

xy 
P 

At high temperatures q is very nearly q a and S?=0 gives a mini- 
mum of free energy. As the temperature is reduced and q increases, 
at a critical temperature Sf =0 gives a neutral value, and below this 



-H) 
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temperature S?=0 gives a maximum of free energy. At this lower 
temperature an equilibrium state must appear at a non-zero £f 
determined by the equation given above. This yields to numerical 
rather than literal analysis, and the variation of F with £f at various 
temperatures is shown in Fig. 7. The equilibrium value of £P goes to 
zero continuously as T tends to T c . There is then no discontinuity in 




Order parameter 2> 



Fig. 7 Variation of free energy with long-range order at different 
temperatures. (The three curves are drawn on different energy 
scales.) 

the entropy at T c , so no latent heat, but the specific heat does increase 
without limit. 
The critical temperature itself is now located by the equation 



p =f-Vf 2 -fxy = 



xyz 



2(z-l) 
For the equiatomic alloy x=y= 1 and 

/ _V/^7=4/7 



1 1 6 Elementary Statistical Mechanics 

for z=8. Then 

/= 16/7 = (l-e 2 ^^)" 1 
or 

e 2AlkT c = 9 / 16 

i.e. 

2A 



kT a = 



In (16/9) 
In the general case 

, xyz 2 , 2A . t. 4(z-l)\ 

/c = 4cti) and wr ln \ 1 -^) 

giving a specific dependence of the ordering temperature on the 
energy A and the composition. 



Comparison with experiment 

The experimental determination of the ordering temperature, by 
diffraction measurements over a range of T which includes T c , is in 
principle straightforward. A considerable practical difficulty is to 
ensure that the material has reached its equilibrium structure at each 
temperature of measurement, since the diffusive interchange of 
atoms is in many alloys very slow near the ordering temperature. 

Another difficulty arises because at given (say, atmospheric) pres- 
sure the atomic spacing in the alloy varies with composition. The 
corresponding variation of the ordering energy A is not easy to 
evaluate. One way to estimate it would be to find the bulk modulus 
of elasticity of pure A, pure B, and the fully ordered AB alloy and 
interpret this as a variation of bond energy with bond length. This, 
however, is certainly unreliable if either pure substance is unobtain- 
able with the same crystal structure as the alloy. For example, 
aluminium cannot be had in the body-centred cubic structure of the 
alloy FeAl. 

A better test, then, is to conduct the experiment at various pres- 
sures so that the ordering temperature is determined as a function of 
composition at constant lattice spacing. This is well discussed by 
Yoon and Bienenstock, 1 who find that for £-brass, at least, the order- 
ing energy is not independent of concentration even at constant 
atomic spacing. Their conclusion is not dependent on the approxi- 

1 D.-Y. Yoon and A. Bienenstock, Physical Review 170, p. 631, 1968. 
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mations made in our statistical argument, because correct statistics 
must still lead to an equation of the form 

where /is determined solely by counting configurations. Thus for a 
set of specimens of different composition, 8(ln r c )/3(ln a) where a 
is the interatomic spacing, is simply d In A/da. Yoon and Bienenstock 
find that d In T c /8 In a changes by 9% for a 3% change in zinc con- 
tent, and observe that interactions of atoms with second-nearest as 
well as with nearest neighbours might readily account for this. 

The introduction of simple second-neighbour interactions is not 
unduly hard in principle. One requires at least one new order para- 
meter, and the statistics are most lucidly handled by explicit use of 
Bethe's method. The task of distinguishing whether the defeat of 
theoretical predictions by experimental facts is due to an unsuitable 
model or an inadequate calculation then becomes much harder, and 
the drawing of sound conclusions from experimental results cor- 
respondingly more difficult. 

This basic difficulty, that the ordering phenomenon is familiar 
only in systems where the forces of interaction are not known in 
sufficient detail for a good test of the theory, recurs when we consider 
still more complex transitions. 



8 

DENSE DISORDERED SYSTEMS 



Some systems without long-range order 

The methods of the last chapter lead to lengthy calculations in the 
treatment of states of non-zero £f, below the ordering temperature. 
If Sf=0, on the other hand, Gab=Gba and the quasi-chemical 
equation in its original form is valid. From the last chapter we have 
now, for N (=N A +N B ) sites carrying JV A , A and N A , B atoms 

o -o - zN *y 

zN x 



and similarly 
with 



4 1+0 



(1+|8-J) 



Gbb = ttt? (1+ ^- x) 



x = 2NJN, y = 2N B /N, 2 = 1 + xy(e 2AlkT - 1) 

The configuration entropy S c can now be reduced, with a little 
manipulation, to the form 

£-_[ Wib ($) + * to (S)] 

-^l+|8-,)l»l±|^-^H-p-x)lnl±|^] 



+ " 
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which shows the entropy of completely random mixing supplemented 
by a group of terms each of which clearly vanishes as ]3-»l, that is, 
as T-+ao, or x-^0. 
The bond energy is at the same time 



U h = Z -%(xE^+yE BB + ^A} 



so the 'configuration part' of the free energy is 

F c = U b -TS c 
which may be written as 

Z f(xE AA +yE BB +xyA)+kT [jV A ln (^)+^ln (^)] 

= F BW + correction terms 

where the Bragg-Williams free energy F BW is obtained by setting 
j8=linf . 

Problem: Show that if E BB =0 this expression is identical with that 
derived previously for the lattice gas. (The terms containing the 
factor xy cancel.) 

Free energy of vibration 

The above expression takes no account of molecular motion, whose 

contribution to the partition function may be estimated using the 

Einstein model discussed on p. 105 above. This gives relations of the 

form 

£o2 (Aona) = I[^.L + (z-^io] 

if we add the simplifying assumption that each atom in a site specified 
as, for example, A on a, vibrates as if it had the average number of 
A and B neighbours for all such sites. 

Since the values of a> 2 given in this way depend on both order 
parameters, the only consistent procedure is to go right back 
through the calculation of the last chapter, emerging with a quasi- 
chemical equation and its consequences. 

5 
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Problem (for the courageous): Try to construct a suitably modified 
quasi-chemical equation. 

However, the simplifying assumption used to obtain the above 
value of a) 2 (A on a) is the dynamical parallel of assuming that the 
bond energy is to be calculated as if each atom were surrounded by 
the expectation set of nearest neighbours — the exact assumption of 
the Bragg-Williams method. 

Even adding the free energy of motion, estimated from our simple 
assumptions, to the Bragg-Williams form of the configurational free 
energy is less straightforward than one might wish, except in the 
limit of low concentration, when an artificially small frequency shift 
enables us to use 

1 kT = --In — 

fi(a> + S) co hot 

Problem: (a) Find the free energy of a solid solution with Sf=0, 
for this simplified case, (b) Find the part of AF C , given above, which 
is proportional to x z in the limit of small x, and compare it with the 
vibration term. 

Segregation in completely disordered systems 

We consider first of all a lattice gas in which the effects of thermal 

motion are assumed negligible compared with configuration effects, 

so that the concentration-dependent part of the free energy is 

approximately 

F B „ = 9NE A + y [0E AA +29(1- 0)A] + NkT[9 In + (1 - 6) In (1 - 9)] 

where 0=x/2=N A /N. 
The chemical potential of this gas is 

ti = 8F/dN A = N-^F/dg 

the derivative with respect to N A being taken for a fixed value of N, 
so 

Hi® = I [E AA +2A-49A]+E A +kT\lti 0-ln (1-0)] 
= E A +^E AA +zA(l-20) + kTlnj^- g 
As 9 increases from to 1, the logarithm increases monotonically 
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from - oo to +00, with a least slope of 4 at 0=^. If A is positive, 
then for temperatures below T s =zA/2k the graph of p against 
will reverse its slope for some range of 0. In fact, fyi/50 vanishes at 
= \ + £(l - T/T s ) m . The corresponding range of n extends a distance 
zAVl - TjT s - kT In [(1 + Vl - T/T 8 )/Q. -Vl- T/T s )] above and 
below the value %zE AA +E A . 

To an arbitrarily chosen value of /x within this range, then, cor- 
respond three distinct values of 0. The outer pair of these, for both of 
which d^jdO is positive, correspond to thermodynamically stable 
conditions. 

To see this clearly, let us consider the exchange of molecules 
between the lattice gas (for example, on an adsorbing surface) and a 
reservoir (for example, of vapour) which fixes the chemical potential. 
At a surface concentration 0, at which d^jdd is positive, the adsorp- 
tion of more molecules will raise the value of ft. Now matter tends to 
move from higher to lower ft, so this fluctuation from equilibrium 
will be restored by re-evaporation of the excess molecules. 

At the intermediate solution of /x(0) = /x(reservoir), on the other 
hand, d^jdO being negative, further adsorption leads to still further 
adsorption and will run up irreversibly to the upper of the stable 
solutions. Similarly, a downward fluctuation will run right away to 
the lower stable solution. 

Thus at sufficiently low temperatures, for a given value of ju. in 
the appropriate range, the disordered lattice gas may exist in two 
forms, one of relatively high and the other of low concentration. 

These two forms are evidently capable of existing in equilibrium 
with each other. We have thus an analogy with the phenomenon of 
condensation of vapour to liquid, the limiting temperature T s 
appearing as the analogue of the critical temperature for condensa- 
tion. 

A completely disordered solid solution can be treated in almost 
exactly the same way, with the precaution that /x is to be calculated 
recognizing that if N A or N B is increased N likewise increases: for 
example p A =dF/dN A for fixed JV B , not for fixed N. 

Problem: Take 

F = Z -y [0E AA + (1 - d)E BB + 20(1 - 0)A] 

+ NkT[0 In + (1 - 0) In (1 - 0)] 

calculate /j. a and p B , and find the condition that Spjdd = 0, for real 
values of in O<0< 1. 
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The condition for two different concentrations to equilibrate is 
again expressed by a non-algebraic equation which becomes very 
complicated when local ordering and thermal motion are also taken 
into account. However, there is a very simple expression of the 
condition in terms of the graph of GIN against 6. This is quite inde- 
pendent of the model considered, involving only classical thermo- 
dynamics. The Gibbs free energy G, which we have now introduced, 
differs from the Helmholtz free energy F only by the addition of PV, 
which is negligible in condensed systems for pressures near atmo- 
spheric. 

Suppose that G=Ng(ff) is known for given T,P. The chemical 
potentials are defined by /x A = 8G/dN A , fi B = 8G/8N B so that for any 
small variation of composition 

dG = /x A dJV A +^ B dN B 

Since the /* depend on 9 and not N, except for very small N, we have 
then 

G = Ma-Na + iVVb 
so necessarily 

dG = fi A dN A +fj. B dN B +N A dfi A +N B d/j, B 

and we must have the constraint 



In particular 



Now 

i.e. 

and 



N A d/* A + N B d/i B = 

g£-/*A-/*B+0 W +(l-0) W 



= M-a-^b 

Thus at any particular value of 6, 

The condition that solutions at the same temperature and pressure, 
but different concentrations t and 6 2 , should equilibrate, is 

Ma(1) = Ma(2) and M b(1) = /%(2) 
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It follows at once that 
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80 K) 80 K) 



that is, that the slope of the graph of g against is the same at 6 t and 
2 . We call its value S. 
Then 

g(l) = fi B (l)+S0 x 

g(2) = n B (2) + S0 2 
but^ B (l)=^B(2), so 

g(2)-g(V) = S(0 a -6d 

that is, the line drawn through g(l) with slope S (tangential to the 
curve, then) passes through g(2). 

Only those solutions can be in mutual equilibrium, then, whose 
representative points on this diagram (Fig. 8) are the points of 
contact of common tangents to the curve of g(0). 




Atomic fraction 



Fig. 8 Construction to find the compositions at which two phases 
in a two-component system will equilibrate at given temperature 
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The argument applies even if the equilibrating solutions differ in 
structure, so that g(8) is continuous only in discrete sections. 

Lattice models of the liquid state 1 

The fact that critical phenomena on surfaces and in solid solution 
can be represented in terms of the lattice gas makes it tempting to 
construct some similar model of the liquid state. In this state, how- 
ever, there is no immediate guide to the choice of lattice, since no 
regular geometrical structure survives melting. 

The fundamental property which limits the mesh of the lattice in 
the method we have been using is that one lattice point may be 
occupied by only one molecule: immediately neighbouring lattice 
points may be occupied at the same time. If we do not wish to embark 
on a new calculation of configuration statistics, then, the separation 
of nearest neighbour lattice points must not be less than a molecular 
diameter. 

There is already a fair difference between a molecular diameter 
estimated from the density of the crystal near absolute zero and the 
density near the melting point. Typical materials expand by about 
10% in volume over this range, and by about another 10% on melt- 
ing. In any event, the maximum reasonable density of lattice points 
will correspond to close packing at a separation equal to the inter- 
molecular distance at zero temperature. 

In the original version of Lennard-Jones and Devonshire, the 
number of lattice points was taken equal to the number of molecules, 
and each molecule was regarded as moving in a cell of volume equal 
to the average volume per molecule. The potential field in which it 
moved was estimated by supposing the cells to be in close-packing, 
and setting each neighbouring molecule at the centre of its cell — as in 
the Einstein model of a close-packed crystal. 

Problem: Assume cubic close-packing, and a Lennard-Jones 
potential such as ^>(r)—ar~ 12 —br~ 6 , for the interaction of two mole- 
cules. At what interatomic distance does the potential energy of a 
molecule at the centre of its cell change character from a minimum 
to a maximum ? 

Hint: Since the only ellipsoid of cubic symmetry is a sphere, you 

1 We shall follow certain developments of a method originally due to J. E. 
Lennard-Jones and A. F. Devonshire (Proc. Roy. Soc. A170, p. 464, 1939, and 
earlier) which are summarized and extended by J. S. Rowlinson and C. F. 
Curtiss (J. Chem. Phys. 19, p. 1519, 1951). 
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need only consider displacements along a symmetry axis chosen for 
your convenience. 

At liquid densities the potential field near the centre of the cell is 
nearly a harmonic oscillator potential, and the model is identical 
with the Einstein picture of a crystal. At much lower densities, each 
molecule is practically uninfluenced by its neighbours, and its parti- 
tion function is nearly that for perfectly free motion through the 
volume of its cell. 

However, there is a problem about the entropy. In discussing the 
Einstein crystal model previously, we had no need to consider the 
identity of molecules, because what is located at each lattice point is 
not a particular molecule but a state of excitation. Alternatively, if 
we consider the molecules labelled there are N\ configurations for N 
distinct molecules on N sites, and this factor is just cancelled by the 
Nl permutations of the molecules among themselves. 

When a system is expanded, in the cell model, the same considera- 
tion leads us to write for the partition function of N non-interacting 
molecules in total volume V the expression 

\NA 3 ) 

which is much less than the true value which we have previously 
shown to be 



Nl \A7 



The error is a factor of e~ w , which implies an underestimate of the 
entropy by just k per molecule. 

This extra entropy, which any correct theory must build in, in 
suitable stages, along the transformation from crystal to vapour, is 
often called the communal entropy, and is extensively discussed in 
the earlier literature. Eyring suggested that while the simple cell 
model failed to account for melting, and therefore could not be right 
for both crystal and liquid, the cell model with the communal 
entropy arbitrarily added might be a good model for the liquid. The 
fact that the entropies of melting of several simple liquids are be- 
tween k and 2k per molecule does make this seem reasonable. 

Lennard- Jones and Devonshire constructed a model for the melt- 
ing of a crystal which brought in an entropy of fusion corresponding 
to disorder of the molecules over a larger number of sites. 

To have a well-defined model, they considered the 'octahedral' 
interstitial sites in a face-centred cubic crystal to be added as a )8-class 
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to the a or normal sites. These interstitial sites relate to the normal 
sites exactly as the sodium ion positions in rock-salt relate to the 
chlorine ion positions, and are equal in number to them. 

This system can be treated to a good approximation by the quasi- 
chemical method, and gives a good value for the entropy change and 
the change of volume on melting if the ordering energy is chosen so 
as to give the correct melting point. The distribution of molecules is 
nearly completely randomized above the melting point, and the cor- 
responding change, k In 2, of configurational entropy accounts for 
about half the entropy of fusion. 

A crystal at finite temperature already has some of its regular 
lattice sites unoccupied. In many substances the equilibrium number 
of interstitial sites occupied is much less than the number of regular 
sites vacant, so that the total number of regular sites, both occupied 
and vacant, increases with rising temperature. One should expect 
correspondingly to have vacant cells in the liquid model. 

Since, however, the volume expansion from zero temperature 
through the melting point is only of the order of 20%, the number 
of cells in the liquid near the melting point can only be about 20% 
more than the number of molecules, rather than 100% more as 
Lennard- Jones and Devonshire require. These authors suggest that 
many vacant cells will shrink in the same sort of way that thin cracks 
in a crystal collapse, their walls drawing together, but it is far from 
clear that empty cells can shrink without disappearing. 

The comparative success of their model in describing liquid proper- 
ties suggests indeed that the number of distinct configurations of a 
liquid near its melting point may be equivalent to the number of 
arrangements of the molecules on a lattice with twice their number of 
sites. 

The question immediately arises whether distinct irregular con- 
figurations of the liquid can actually be mapped as distinct configura- 
tions on a regular lattice of sufficiently many points; in particular, 
if it can be done in such a way that nearest neighbours remain nearest 
neighbours in every instance. The answer to the particular question is 
almost certainly no, for the reason that the random structure prob- 
ably produces such things as five-membered rings in more profusion 
than any crystalline structure could do. 

The more general question leads to an exploration of statistical 
geometry, which is at present a largely experimental science. Studies 
of random packings made by J. D. Bernal and others reveal a number 
of interesting features which are best described in terms of a certain 
'natural' choice of cells. 
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At any instant, there is a region surrounding each molecule which 
belongs to that molecule, in the sense that points in the region are 
nearer to the centre of gravity of the given molecule than to that of 
any other molecule. This region is a polyhedron known as the 
Voronoi polyhedron, and is obtained by the following construction. 
We join the centre of the given molecule to the centre of its nearest 
neighbour and construct the plane which is the perpendicular bi- 
sector of that straight line. The operation is repeated for each of the 
less near neighbours in turn, until a closed polyhedron is formed. It 
may still be possible to cut off some of its corners by continuing the 
construction, so we go on until we reach a neighbour whose distance 
from the original centre is more than twice the distance from that 
centre of the most distant point in the smallest closed polyhedron, 
enclosing the centre, formed by the planes already drawn. The set of 
Voronoi polyhedra fills the whole space available to the system. 

If then the Voronoi polyhedra are taken as the cells, in the sense of 
the statistical model, there are by definition no empty cells, and we 
lose the contribution to the configuration^ entropy from random 
distribution over more cells than molecules. The counterbalancing 
feature is the distribution of types of Voronoi polyhedra. This dis- 
tribution cannot be entirely random, though D. H. Everett has shown 
that if one groups the polyhedra simply according to the number of 
faces, and estimates the entropy of random mixing of the groups of 
polyhedra in the proportions found by Bernal's model experiment, 
a fair value of the liquid entropy near the melting point is reproduced. 

Bernal and Finney, on the other hand, have shown that the 
internal energy computed directly from the model distribution of 
bond lengths gives a very good value for the heat of fusion, so the 
model is by this (rather weak) test as random as the real liquid. 
Further development in this direction depends very much at present 
on the use of electronic computers to simulate model experiments. 

Returning now to the problem of constructing a cell model, with 
vacant cells, for a liquid or dense gas, we should like to choose a cell 
size such that configurations with more than one molecule per cell 
are unlikely, while interactions between molecules in nearest-neigh- 
bour cells alone need to be considered in the statistics. These two 
requirements are mutually compatible only for short-range inter- 
molecular forces. A helpful test of consistency will be to carry 
through the calculation for an arbitrarily chosen cell volume and 
minimize the resulting free energy with respect to that cell volume as 
a variable. If the 'best' cell volume emerging from this satisfies the 
first two requirements, the method is at least not self-contradictory. 
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If then we have N A molecules distributed over N cells, we have an 
Ising lattice gas, whose partition function in the quasi-chemical 
approximation is the product of a translation factor, which we write 
as (v f IA a ) N * where v, is a free volume, and a configuration factor 
exp(-FJkT). This free energy F c will be of the form calculated 
previously for a disordered lattice gas (see p. 119) only if (v f /A s ) for 
each cell depends on the occupation of neighbouring cells in a very 
restricted way. As will be seen, some care is then needed in relating 
v f to jS. 

At high densities, several cells around any given one will be occu- 
pied on average, and the average potential field, due to its neighbours, 
which each molecule experiences in its own cell will be nearly that of 
a spherical harmonic oscillator. It will then be natural to set 

v f /A 3 = {kTlhvf 

where v depends on the density and (slightly) on the selected cell 
volume. In the limit of very high density, N=N A and the partition 
function is 



(IcTy* ( zN A E AA \ 



where E AA , of course, also depends on density. 

At very low densities, neighbour cells are on average empty and 
v f is just the volume of the cell, V/N. Then the partition function is 



m 



(JLY A = _L 



tr 



N A \(N-N A y.\NA 3 ) • N A \ 

for N»N A . 

This method, then, brings in the communal entropy smoothly as 
density decreases. In the ideal gas limit, it will give the correct parti- 
tion function independently of the size chosen for the cell, provided 
only that the cell size does not go to infinity with V, for a given value 
ofiW 

At intermediate densities, different molecules have different sets of 
neighbours, and we must consider how the free volume in one cell 

1 This is precisely what goes wrong in the original method of Lennard- Jones 
and Devonshire. The cells, unlike the Voronoi polyhedra, have no physical 
status: they are merely aids to calculation, and may be chosen of any size. But, 
if we choose cells so large that at a given density we are likely to find more than 
one molecule in one cell, the correspondence with the two-state Ising model 
vanishes and all the statistics must be recalculated. This is very clearly brought 
out by J. G. Kirkwood (J. Chem. Phys. 18, p. 380, 1950). 
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depends on the occupation of the neighbour cells. Our guide in this 
is the requirement that minimizing the constrained free energy for 
given bond numbers, with respect to the bond numbers, must lead to 
the form of partition function already quoted. 

This can only happen if the v, factor for a given occupied cell 
depends on the number z(l-w) of occupied neighbours in just the 
same way as the corresponding bond-energy factor exp [-^z(l-w) 
x E AA /kT]. This is guaranteed if In v f =wa 1 + (l-w)a , a linear de- 
pendence of In v f on w. Set a li0 =lnj li0 

One can now either force the expression to be correct at very high 
or very low densities, by choosing jx = v f (l), j = v f (0), or regard j 
and j\ as parameters to be adjusted so as to get v f right (by mini- 
mizing the free energy) at the actual density under inspection. 

Using the simpler approach, we see that v f (l) is the free volume 
in a cell whose neighbours are all empty, so is just the volume of the 
cell : v f (0)/A 3 , on the other hand, is the partition function of the single 
particle in a cell all of whose neighbours are occupied. At liquid 
densities near the melting point, this will be near enough QcTjhvf. 

Problem: How will v depend on N at fixed density, for the Lennard- 
Jones potential? For w^O or 1, the potential field is generally asym- 
metric. Consider the isotropic approximation in which each neigh- 
bour cell is occupied by (1 — w) of a molecule. 

We have now associated with each molecule with z(l — w) neigh- 
bours a factor in the partition function 

exp L 2kT^\ Jo Jl 

=M-irMifrh^'"(i)]} 

The total partition function is then 

^ ! ;^cxp( zNaEa * \ 

x Leo8+i)J L(i-0)(j3+i)J 

where $ is now given by 

p = 1+40(1- mjjjoy'c-'**"" -i] 

corresponding to an increase of E AA by (2kT/z) In (jib'o)- 
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It turns out that the developments of this model give an unsatis- 
factory account of the critical region. It is suspected, though hardly 
yet proved, that this is due to underestimate of the communal entropy 
at intermediate densities. 

Although the statistical mechanics of liquids cannot be further 
developed here, considerable progress has been made in the last 
twenty years both by numerical calculation with large high-speed 
computers and by analytical calculation with more powerful mathe- 
matical methods. There are still very few exact results known for 
three-dimensional systems, but the exact theorems which are known 
for systems in one and two dimensions provide stringent tests for 
the approximation schemes which are suggested for the treatment 
of liquids in three dimensions. 

J. A. Barker has developed an approximate theory of liquids, 
which is probably a nearly exact theory of liquid crystals, based on 
the exact partition function for a one-dimensional gas of interacting 
particles. 1 



1 A clear discussion is given in his book, Lattice theories of the liquid state 
(Pergamon, 1963). 
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Long-range interactions — gravity and Coulomb forces 
If the interactions between particles fall off sufficiently rapidly with 
distance, a sensible first approximation is obtained by considering 
only the interaction of each particle with the small group of its nearest 
neighbours — those particles whose Voronoi polyhedra have a face 
in common with that of the central particle. This is by no means true 
of interactions which fall off more slowly. The inverse square law of 
force gives rise to several interesting problems of this nature. 

Gravitational repulsion is unknown, and it is easy to see that in 
consequence the chemical potential of a substance at given concen- 
tration cannot be independent of the total mass of it present. 

The gravitational potential at the surface of a sphere of mass m 
and radius r is 

-Gmr- 1 

where G is the gravitational constant, so the gain in energy of an 
increment dm added to it is 

— Gmr' 1 dm 

If moreover Gmr' 2 is so small that the addition distorts neither 
m nor dm (distortion due to weight and reaction), this energy gain is 
the whole gravitational contribution to the chemical potential per 
unit mass, which is then 

-Gmr' 1 

or, if the sphere is of uniform density p 

-47T P Gr 2 J3 
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Since G is exceedingly small compared with the constants of the 
Van der Waals and valence forces, the gravitational potential per 
molecule is comparable to kT, at terrestrial surface temperatures, 
only for bodies of planetary mass. The internal pressures in planets, 
and still more in stars, are so great that the PFterm in the Gibbs free 
energy is not merely considerable but sometimes dominant in 
determining the thermal behaviour. 

The general relation of pressure to density, in a spherical gravitat- 
ing body in hydrostatic equilibrium, is easy to find, for it simply 
states that the weight of an element of the body is balanced by the 
difference in pressure on its horizontal faces. The weight is deter- 
mined by the mass of the element and the mass of that part of the 
body which lies nearer to the centre. Thus 

dp _ P G r 

where pressure p and density P are some functions of r. Obviouslv 
d/?/dr=0atr=0. 

For some purposes it is more transparent to rearrange and differen- 
tiate this relation, giving 



L.± ( r2 d P\ _ ^ r 



The condition dp/dr=0 at r=0, of which we lose track on differen- 
tiating, must now be explicitly restated. 

Further progress depends on finding another usable relation be- 
tween p and P . This will be amply supplied by the equation of state 
if we know the temperature as a function of r. The temperature 
depends in general on the mechanisms of heat generation and heat 
escape— for example, a planet could well have a large isothermal core 
surrounded by a layer of radioactive rocks whose energy output 
supplied the whole of the heat flow to the external surface— and 
these depend in turn on, at least, the local temperature and density, 
leading to a rather nasty group of interlocking relations. 1 

As a short-cut we may postulate a convenient algebraic relation 
between p and p, perhaps something like an adiabatic equation of 
state. The true adiabatic relation would have one advantage, that in a 
region free of energy sources this temperature distribution 'would be 
undisturbed by radial currents. Matter flowing slowly in or out would 
by adiabatic compression or expansion adjust to the temperature 

• l^*" but not verv elementary discussion of such questions is given 
in S. Cnandrasekhar's Introduction to the Study of Stellar Structure. 
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appropriate to its level. Such currents would not then disturb the 
hydrostatic equilibrium by upsetting the pressure-density relation 
which that equilibrium requires. 
Trial of p=Pp y leads to 



r 2 dr V P <W yP 



with dp/dr = at r = 0. The arbitrary choice of y = 2 leads to the simple 

result 

p = ^4/- -1 sin kr 

where k 2 =2irGlp. 

It would evidently be wise to keep the surface of our body inside 
R , where kR =TT. Setting the surface at R , with p=p=0 at all 
points outside, meets the conditions so far imposed and leaves A, 
and so the total mass, completely unspecified. We have therefore a 
set of equilibrium bodies all of the same radius and of all conceivable 

masses. . , 

One might conclude that the proposed equation, at least witn 
y-2, was thus shown to be inapplicable. 

Problem: Discuss the use of p=P(p 2 -po)- 

Certainly lower values of y are more plausible. However, they lead 
to almost equally remarkable conclusions. For example, the large 
class of dense stars known as the white dwarves is believed to be cold 
in the sense that the equation of state is essentially independent of 
the temperature. 

R. H. Fowler first suggested that the density in the interior of these 
stars is so high that the extended structure of electron shells, which 
characterizes atoms in the normal states of matter, is wholly broken 
down. This dense matter consists of atomic nuclei floating in a Fermi 
gas of essentially free electrons. It is effectively cold because the 
Fermi energy is much larger than kT in the star. 

Problem: A typical white dwarf might have a mass of 10 30 kg, a 
radius of 10 7 m and a surface temperature of 10 4 K. What would the 
internal temperature T, have to be if kT t were to be equal to the 
Fermi energy of electrons in cold matter at the average density of 
this star? (Assume that the atomic nuclei present contain on average 
equal numbers of protons and neutrons.) 

The pressure of a cold electron gas is easily seen (e.g. from £ ccn m ) 
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to be proportional to the 5/3 power of the density, so long as the 
Fermi energy is much less than the rest energy of an electron In the 
extreme relativistic limit all particles look like photons, with energy 
just equal to the size of the momentum multiplied by the speed of 
light; g becomes proportional to n 1 ' 3 and y drops to 4/3. 

Even with y=5/3, it turns out that the radius of a white dwarf 
should decrease with increasing mass. The cold plasma is too com- 
pressible to support itself against its own weight, when a sufficient 
mass is present. Landau showed as early as 1932 (see, for example 
Chandrasekhar's book already quoted) that the radius of the white 
dwarf should go to zero at a mass not very much larger than that of 
the sun— and much less than the mass of many known normal stars 
A point particle of solar mass would indeed be an interesting object 

At least one phase change must intervene as the density increases 
The free neutron moving in ordinary matter is a radioactive particle, 
decaying into a proton, an electron and an antineutrino with a life- 
time of about 12 minutes. 

n^-p+e+v 
The energy release in the reaction is 780 keV. 

If the neutron moves in a Fermi sea of electrons of depth greater 
than 780 keV (well over the electron rest energy, as it happens) this 
decay is inhibited, for there is not enough energy to raise the product 
electron to the Fermi surface. In fact, protons and electrons now 
revert to neutrons and neutrinos by the reaction 

p+e->n+v 

Problem: At what numerical density of electrons is the Fermi 
energy equal to 780 keV? The distribution in momentum is the same 
as in the non-relativistic case, but we must take the total energy E 
of each electron (Fermi energy plus rest energy of the electron) as 
given in terms of the momentum p and rest-mass m by the general 
relation 

E 2 = p 2 c 2 +m%c i 
where c is the speed of light. 

At very high density, then, the stable state of matter consists of a 
Fermi sea of neutrons, with just enough electrons— and the cor- 
responding protons— to inhibit the neutron £-decay. This is still a 
Fermi fluid, and the maximum mass of a neutron star is still little 
more than that of the sun. Moreover, there is no experimental evi- 
dence for the existence of neutron stars, other than the appearance 
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in the pulsating radio stars, of bodies which are both too energetic and 
too small to be white dwarfs of the more usual sort. 

The problem of what possible fate awaits massive stars when their 
reserves of thermonuclear fuel run out must be left to the speculations 
of the astrophysicists. The cold 'plasma', well established as the state 
of matter in the white dwarfs, leads our attention to the plasma 
state at lower concentrations, where the charged particles are non- 
degenerate at accessible temperatures. 

Debye-Huckel theory of electrolytes 

Much of both technical chemistry and biochemistry is concerned 
with the behaviour in water of substances which are seen, e.g. from 
electrolytic phenomena, to dissociate as they dissolve into electrically 
charged particles. The overall electrical neutrality of the solution 
avoids the size effects which are quite inevitable in gravitation, and 
the thermodynamic potentials depend only on the concentrations, 
the pressure and the temperature. 

A solution containing only two sorts of ion might be described as 
a three-component lattice gas. In view of the long-range interactions, 
the count of configurations of approximately equal energy can not 
be handled by the methods of the last chapter, and we start with a 
different set of approximations, emphasizing first of all the electro- 
static interaction energy. 

This electrostatic interaction separates into two parts, interaction 
of ions with solvent, and of ions with ions. Good ionizing solvents, in 
which a salt like sodium chloride will dissolve freely as separate 
sodium and chlorine ions rather than as the diatomic sodium chloride 
molecule, must respond to the ionic charges so as to lower their free 
energy in solution. Solvents such as water and ammonia have large 
molecular dipole moments and correspondingly large dielectric 
constants. 

The large dielectric constant not only lowers the energy of solution 
of a given ion, but reduces the effective interaction of ions with each 
other. There is a complication because the electric field close to an 
ion is so strong that dielectric saturation occurs. For this reason 
among others, the elementary treatment to be described, following 
the early work of Debye and Hiickel, is applicable only to very 
dilute solutions. 

Ionic atmosphere near a fixed charge 

We consider first the electric field and ionic concentrations in equili- 
brium in the neighbourhood of a fixed point charge Q, supposing 
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that the ions have charges z x q and z 2 q, where q is the charge of a 
proton, and the medium has dielectric constant e. Let the numerical 
concentrations of the ions at distance r from Q be n^r) and n 2 (r) 
respectively. 
The charge within a sphere of radius R centred on Q is evidently 

Q+q \ B (Zifix+zgiia^r" dr = 4nR 2 9(g) (A) 

where 2 is the radial electric displacement. The electric intensity 
E=£&je, and the electric potential at R with respect to a zero at 
infinite distance is 



<KR) = j" E(r)dr 



so 

®(R) = -ed<f>/dg 

Differentiating equation (A) with respect to g gives then 

The concentrations n^ and n 2 depend on 4>, being in fact determined 
by the constancy of the chemical potentials pi and /* 2 of the ions 
throughout the system. The potential /* l5 for example, contains a 
term z x q<j>. If we assume that it contains no other terms dependent 
on <f> or E we are ignoring various effects of polarization or electro- 
striction, which can be comparable with the original term only for 
particular systems and very close to the central charge. 

In addition, we make the strongest simplifying assumption about 
the rest of the /*, namely, that apart from the electrostatic interaction 
the ions move as if perfectly independent of each other. Then we 
may write 

Hl = N x e -'■!«*, "2 = N 2 e-"** 

where z 1 N 1 +z 2 N 2 =0 expresses the electrical neutrality of the solu- 
tion far from the perturbing charge Q. 

It should be noted that this takes only partial account of the mutual 
action of the ions, precisely because of the assumption that both sorts 
of ions see the same potential <f>. The charge pair, Q with z x q at dis- 
tance r, has a different equilibrium atmosphere from Q with z 2 q at 
distance r. To pursue this point leads to substantial improvement in 
the theory at the cost of substantial mathematical exertion. 

We again take the primrose path of linearization, with the addi- 
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tional assumption that pzq<f> is in each case much less than unity, so 
that 

«! = JVx(l -jferf*). n a = JVa(l -jSz 2 ^) 

The solution of this last equation which goes to zero as R goes to 
infinity is <f>=fexp (-R/D)/R, where D, which is called the Debye 
length, is given by 

q\N lZ l + N 2 zl) 

The decaying exponential expresses the screening of the Coulomb 
force by the ionic atmosphere. If we set the total ionic concentration 
N 1 +N 2 =N, then for given z u z 2 

D oc (eT/N) 1 ' 2 

The mean separation of neighbouring ions is 

L ~ JV _1/3 

so for sufficiently dilute solutions we always have D»L. In this case 
there are many ions within a Debye length from any given point, and 
the discussion in terms of a smoothly varying average potential is 
likely to be justified. 

Problem: What order of concentration would give D=L for 
monovalent ions in water at room temperature ? 

The charge Q does not affect the form of the screened potential. 
It does of course determine the amplitude/, for the electric displace- 
ment at some sufficiently small R must be Q/4itR 2 , provided we can 
find a sphere which contains all of Q and is never entered by other 
charges. If also the screened Coulomb potential is valid down to this 
surface, say of radius b, we have outside b 

e= _w^f_r* iD 



dR 
so 

e / e -biD n 



~ niD 1 1 _i\ 
R \D + Rj 



«»-*£= (i+i) 



Q 

4tt6 2 
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i.e. 

<ge w D 

j ~ Arreib + D) 

Then 

The part of this due to the presence of Q itself and the solvent is 
QjAneR, so the contribution from the ionic atmosphere is 



Q 

4neR 



[ D ~(b-R)ID ll 
b + D* l \ 



In particular, the potential at (and inside) b, due to the surround- 
ing ions is — Q/[4iTe(b+D)], which is hardly dependent on b for 
b«D. 

Problem: There is a net space-charge round Q. Show that the total 
charge of the ionic atmosphere, with the assumptions made above, 
is -Q. 

The interesting application of these results is naturally the situation 
where the charge Q is itself an ion of radius b and charge zq. The self- 
energy of such an ion in air is of the order of {zqfl%-nbe , being 
exactly this if the charge is uniformly spread on the surface of a 
sphere of radius b. The corresponding self-energy in the dielectric is 
simply obtained by replacing the dielectric constant of the vacuum 
by that of the medium, so the self-energy drops, when the ion passes 
into solution, by the amount 



i«KH 



The electrostatic binding energy between two ions in a diatomic 
molecule in vacuum will be 



477(6 1 + 6 2 )e 

so there is no difficulty in regaining all of this on dissolving the 
separate ions in a medium with high e. 
For each ion, there is a large term in its free energy 



i«KK) 



8770 

independent of concentration and dependent on temperature only 
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through the temperature dependence of the dielectric constant. We 
proceed with the identification of the concentration-dependent term. 

Reversible charging and electrical free energy 
The electrical potential across a condenser of capacitance C, at 
charge Q, is V= QIC. Addition of an element of charge dig involves 
work VdQ = QdQ/C, so if C is constant through the charging 
process, the stored energy is g 2 /2C or iCV 2 . 

C is the product of a geometrical factor /and the dielectric con- 
stant of the insulator, e. For a conducting sphere of radius b, 
C=4nbe. Since e varies with temperature, rapid charging produces a 
change in temperature of the dielectric. This change is analogous to 
the heating of a gas in reversible adiabatic compression, and is quite 
independent of any dissipative process which may occur. 

If the condenser is thermally connected to a thermostat, and 
charging is so slow that the dielectric remains at constant tempera- 
ture, a finite heat transfer accompanies the isothermal charging. 
The capacitance C remains constant, so the electrical work done is 
Q 2 /2C, which is evidently stored as free energy in the dielectric. 

The recipe for identifying the electrical free energy associated with a 
charge distribution is, then, to build up the charge reversibly from zero 
under isothermal conditions and evaluate the work done in the process. 

In an ionic solution there are two such imagined processes of 
dominant interest, leading respectively to the chemical potential 
and the total free energy. 

In the first process, due to Guntelberg, we imagine all ions but one 
to be already present and bearing their full charge, so that D is 
fixed at a constant value. The ion of particular interest has instead of 
charge zq the charge zqg where £ is to be slowly increased from to 1. 
So far as electrical interactions are concerned, this gives a mechanism 
for introducing an ion reversibly, without worrying about boundary 
effects. The charging parameter is an imaginary device, but violates 
only the law of conservation of charge, which is not part of thermo- 
dynamics: in fact, if we switch on simultaneously a small set of far 
separated ions of zero resultant charge, only the law of quantization 
of charge is infringed. 

At a given value of £, the potential at the ion's surface due to its 
own charge is zq^Anbe; the potential due to neighbouring ions is 
— zq(j[Aftre{b + D)\. Increasing £ by df adds charge zq d£ at the cost 
of work 



4*rc \b b + D) SaS 
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so the total work done in charging to g= 1 is 
(zq) 2 (zq) 2 

Smb &7re(b+D) 

The first term we have already mentioned as the self-energy. The 
second term is the ion interaction part of the chemical potential. 

Debye applied the charging process to all ions simultaneously, at 
specified values of N u N 2 . This gives the electrical part of the Helm- 
holtz free energy. The Debye length is now variable. When each 
charge is multiplied by f, the Debye length becomes Z>/£. 

The potential due to neighbouring ions is 

zq e 



4ttc bg+D 

so the work done in the charging process is 

(zq) 2 r x e_d£_ = _(zq£B* r»i» x 2 dx 
4t7£ J bg+D 4ne b 3 J x+l 

47reD\3 4 D '"} 

The free energy of ionic interaction, then, is in first approximation 



Problem: Verify that this statement is consistent with the chemical 
potential derived by Giintelberg's approach. 

If D is expressed in terms of the N, we obtain 

Flnt "~IWfcfl J ) 

quite unlike the N m q 2 /e which gives the order of magnitude of the 
binding energy of the hydrated crystal— the limit of high concen- 
tration and low temperature. 
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(1) Integrals leading to the ^-function of Riemann 

Various expectation values for the ideal gas of bosons or fermions 

contain the integral 

i-XT= * m e-*(l + e-*+e- a * + e- 3 *+---)dx 
Jo e ± *■ Jo 

r x m e" n * dx = «-« m+1> r y* e-» dy 
Jo Jo 



The term 



= m!n- (m+1) 



rx*dx = ml T2 - (m+1)+3 - (m+1) ) 

Jo e* ± 1 

The series of positive terms defines Riemann's function £(/«+l). 
The alternating series is 



£(m + l)-2[2- (m+1) +4-< m+1 >+---] = (l-2- m K(w+l) 



so 



C a x m dx /l-2- m \ ,., , „ 
J e^±T = l orl j^^+D 

The ^-function is fully tabulated in Tables of Functions by Jahnke 
and Emde. A useful set of values is 
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2-612 


1-645 


1-341 


1-202 


1-127 


1-082 
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It is interesting to note that for integral n 

mn\ - w (2,r)2n 

where the first few Bernoulli numbers B n are 

n 12 3 4 



B 1111 



6 30 42 30 

(2) A Fermi expectation value 

f " f(x) dx r ,, , , 
Jo e^TT-J ^> d ^ 

f 00 /(x) dx r a .. . e ( *- a> 
- J o e <*-»>+l J - /W e ( *-" ) + l d * 

f °° Z(x + a) dx f ° /(x + a) dx 
~Jo e*+l J_„ 1- 

r° /(x + a)dx f°. 

Jo e*+l Jo 



r 



e 
/(x + a)d x f a /(a-x)dx 

e*+l 
7(a + x)-/(g-x) , 



e*+l dX ' 



neglecting I 

Ja 



SO 



/(g-x)d* 
e*+l 

= 2j" [xf («)+fJ/» + • • •] (e* + 1)" 1 dx 

(3) Debye functions 

The phonons, or quanta of lattice vibration, in a crystal constitute a 
boson gas with no constraint on number, but with an upper limit on 
the single-boson energy. In Debye approximation, this upper limit 
is kO D . The finite integrals required for the thermodynamic functions 
are very well tabulated by Stark and Kortzeborn in University of 
California Radiation Laboratory Report, no. 17225 (1966). 

(4) Distribution and-density functions 

Given a series of events E r with a quantity x r associated with each, 
the set of x, may be described by the distribution function D x (a) 
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which is the fraction of the total number of events for which x T <a. 
ThenO<Z)<l. 

Given many different series from the same class of events, the 
individual distribution functions crowd towards a limit function as 
the number of events in each series increases. DJa) in this limit is 
called the probability of (x<a). 

p x (a)=(d/da)D x (a) is the probability density function for x. In 
physical measurement, p x commonly has one single maximum. The 
p.d.f. is itself often called distribution function, especially by 
physicists. 

The moments of x are the expectation values of x n for integral n. 
p x is often sufficiently characterized by only the first two moments. 

/•OO 

x = I xp{x) dx 

J — 00 
/»0O 

x 2 = x 2 p(x) dx 

J — 00 

Note that the mean square deviation from the mean, which is the 
most convenient measure of the dispersion of x, is given by 

i»0O 

(jc— xf = (x—x) 2 p(x)dx 

J — 00 

= x 2 — 2x-x+x 2 



The normal distribution, which often represents a series of good 
measurements of a single quantity, has p.d.f. 

p(x)= l_ e - ( *- g)2/ag2 

where x=a, x z —x 2 = <r i . The mean and the m.s. deviation, then, 
suffice to determine the normal distribution function. 

(5) Conditional probability 

Associate probabilities p i} (probability of E } and H ( ) with joint events 
constructed from classes E and H. 

p(H t \E,) = 4*- 

ZPu 
i 
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is conditional probability of H ( , given E } , and 

P(Ei) = 2.Pu 
i 

is probability of E, for any H at all. 
Then 

p(H t | E,)p(Ei) = Pu = />(£> I #<)/>(#<) 
or 

p(H, | £,) ;>(£, | H t ) 
p(H t ) P&) 

This result is profitably used with a rather different interpretation 
of the symbols. 

(6) Likeliness 

I regard an event E as having likeliness P when I am prepared to 
wager P against (1 —P), on stakes I can afford to lose, on the occur- 
rence of E (and simultaneously, in the usual terms of a fair wager, to 
bet (1 — P) against P on the non-occurrence of E). If events E, have 
known probabilities (in the sense of limiting frequency fractions) p } , I 
must for consistency make Pj=p s , and then actual relations among 
/>'s must correspond to true relations among P's. 
In particular 

ptkf = nfer (Bayes equatlon) 

(7) Verification 

Hi are a set of alternative hypotheses about a situation; E, are 
alternative events occurring in the situation. The occurrence of a 
particular E } lends support to H t in the sense that the likeliness of 
H u given E f , is to the a priori likeliness of H t as the likeliness of E jt 
given Hi, is to the the likeliness of E h irrespective of hypothesis. 

Very commonly, H t is precisely a statement about the P } (which, 
if Hi were true, would be a statement about the />,). Then the right- 
hand side of Bayes equation is known for each H and E. With luck, 
P(H t ), which is hard to estimate, is not vastly different for different 
Hi. 



FURTHER READING 



No book is likely to be so bad as to contain no useful knowledge. 
Here are a few really good ones. 

Information theory 

J. R. Pierce, Symbols, signals and noise (Hutchinson, 1962) — a first- 
class introduction. 

A. I. Khinchin, Mathematical foundations of information theory 
(Dover, 1957), and A. Feinstein, Foundations of information theory 
(McGraw-Hill, 1958) — excellent but not very easy mathematical 
books. 

Thermodynamics 

M. W. Zemansky, Heat and thermodynamics (McGraw-Hill, 1968) — 
an excellent undergraduate text. 

E. A. Guggenheim, Thermodynamics (North-Holland, 1957) — covers 
the whole field of chemical thermodynamics. Use a recent edition. 

P. T. Landsberg, Thermodynamics (Interscience, 1961) — concerned 
primarily with the foundations of the subject, and contains a great 
deal of statistical mechanics. 

Statistical mechanics 

E. Schrodinger, Statistical thermodynamics (Cambridge, 1952) — a 
little book full of lucid understanding. 

J. S. Dugdale, Entropy and low temperature physics (Hutchinson, 
1966) — particularly interesting on the low-temperature region 
where thermal behaviour depends strongly on quantum properties. 
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D. ter Haar, Elements of thermostatistics (Holt, Rinehart, Winston, 
1966) — follows the great Dutch physicist H. Kramers in a very 
complete account at advanced undergraduate level. 

A. Katz, Principles of statistical mechanics (Freeman, 1967) — a good 
short book based on information theory, not going very far into 
applications. 

R. Kubo, Statistical mechanics (North-Holland, 1965) — a good book 
with a particularly valuable and extensive collection of worked 
problems. 

M. Tribus, Thermostatistics and thermodynamics (Van Nostrand, 
1961) — a large and very complete book based on the information 
theory approach due to E. T. Jaynes. 

R. H. Fowler, Statistical mechanics (Cambridge, 1936) and R. H. 
Fowler and E. A. Guggenheim, Statistical thermodynamics (Cam- 
bridge, 1939) — two large standard works which are still very 
useful. 

T. L. Hill, Statistical mechanics (McGraw-Hill, 1956) and K. Huang, 
Statistical mechanics (Wiley, 1965) — both excellent modern ad- 
vanced texts, at post-graduate rather than undergraduate level. 

L. D. Landau and E. M. Lifshitz, Statistical physics (Pergamon, 1969) 
— a master-work by the great Russian physicist and his colleague, 
with the emphasis on physical ideas but using a full range of 
theoretical techniques. 

G. S. Rushbrooke, Introduction to statistical mechanics (Oxford 
University Press, 1949) — an excellent text on traditional lines, rich 
in chemical applications, very slightly more mathematical than 
the present book, but extremely clear on many particular points. 

R. C. Tolman, Principles of statistical mechanics (Oxford, reprinted 
1959) — an old book now, but still with the best and clearest 
account of many points in classical and quantum mechanics as 
well as in statistical mechanics. 

The fundamental paper which establishes the viewpoint developed 
in this book is by E. T. Jaynes, in Physical Review 106, pp. 620-630, 
1957. 
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